H1.6 top features

Top feature 0 in H1.6: (feature 12667

TOP ACTIVATIONS
MAX = 2.828

ve
Tokenve
Feature activation+0.000
taken
Token taken
Feature activation+0.000
that
Token that
Feature activation+0.000
phrase
Token phrase
Feature activation+0.128
and
Token and
Feature activation+0.000
applied
Token applied
Feature activation+2.828
it
Token it
Feature activation+0.011
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
sport
Token sport
Feature activation+0.000
and
Token and
Feature activation+0.000
The
TokenThe
Feature activation+0.000
word
Token word
Feature activation+0.000
karma
Token karma
Feature activation+0.000
is
Token is
Feature activation+0.000
also
Token also
Feature activation+0.000
used
Token used
Feature activation+2.726
in
Token in
Feature activation+0.169
different
Token different
Feature activation+0.197
contexts
Token contexts
Feature activation+0.817
.
Token.
Feature activation+0.000
Y
Token Y
Feature activation+0.011
dep
Token dep
Feature activation+0.000
raved
Tokenraved
Feature activation+0.000
words
Token words
Feature activation+0.000
,
Token,
Feature activation+0.000
were
Token were
Feature activation+0.000
spoken
Token spoken
Feature activation+2.443
by
Token by
Feature activation+0.000
then
Token then
Feature activation+0.000
âĢĵ
TokenâĢĵ
Feature activation+0.000
secret
Tokensecret
Feature activation+0.000
ary
Tokenary
Feature activation+0.000
"
Token "
Feature activation+0.597
el
Tokenel
Feature activation+0.000
k
Tokenk
Feature activation+0.000
"
Token"
Feature activation+0.477
is
Token is
Feature activation+0.000
used
Token used
Feature activation+2.423
in
Token in
Feature activation+0.000
North
Token North
Feature activation+0.000
America
Token America
Feature activation+0.000
to
Token to
Feature activation+0.000
refer
Token refer
Feature activation+2.387
most
Token most
Feature activation+0.000
general
Token general
Feature activation+0.061
sense
Token sense
Feature activation+0.380
,
Token,
Feature activation+0.000
karma
Token karma
Feature activation+0.000
refers
Token refers
Feature activation+2.411
to
Token to
Feature activation+0.000
any
Token any
Feature activation+0.000
action
Token action
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
a
Token a
Feature activation+0.000
Greek
Token Greek
Feature activation+0.000
word
Token word
Feature activation+0.593
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
means
Token means
Feature activation+2.399
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
love
Tokenlove
Feature activation+0.000
of
Token of
Feature activation+0.000
wisdom
Token wisdom
Feature activation+0.000
used
Token used
Feature activation+2.423
in
Token in
Feature activation+0.000
North
Token North
Feature activation+0.000
America
Token America
Feature activation+0.000
to
Token to
Feature activation+0.000
refer
Token refer
Feature activation+2.387
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
different
Token different
Feature activation+0.000
animal
Token animal
Feature activation+0.000
,
Token,
Feature activation+0.000
,
Token,
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
what
Token what
Feature activation+0.000
it
Token it
Feature activation+0.000
means
Token means
Feature activation+2.326
in
Token in
Feature activation+0.026
a
Token a
Feature activation+0.000
loose
Token loose
Feature activation+0.000
translation
Token translation
Feature activation+1.249
is
Token is
Feature activation+0.190
police
Token police
Feature activation+0.000
genocide
Token genocide
Feature activation+0.000
(
Token (
Feature activation+0.000
the
Tokenthe
Feature activation+0.000
word
Token word
Feature activation+0.000
used
Token used
Feature activation+2.127
by
Token by
Feature activation+0.000
a
Token a
Feature activation+0.000
Cornell
Token Cornell
Feature activation+0.000
professor
Token professor
Feature activation+0.000
recently
Token recently
Feature activation+0.000
a
Tokena
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
word
Token word
Feature activation+0.000
of
Token of
Feature activation+0.000
Greek
Token Greek
Feature activation+2.080
origin
Token origin
Feature activation+0.705
that
Token that
Feature activation+0.000
signifies
Token signifies
Feature activation+2.005
something
Token something
Feature activation+0.000
being
Token being
Feature activation+0.000
word
Token word
Feature activation+0.000
of
Token of
Feature activation+0.000
Greek
Token Greek
Feature activation+2.080
origin
Token origin
Feature activation+0.705
that
Token that
Feature activation+0.000
signifies
Token signifies
Feature activation+2.005
something
Token something
Feature activation+0.000
being
Token being
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
short
Token short
Feature activation+0.000
The
Token The
Feature activation+0.000
word
Token word
Feature activation+0.000
bud
Token bud
Feature activation+0.000
d
Tokend
Feature activation+0.000
ha
Tokenha
Feature activation+0.000
means
Token means
Feature activation+1.918
awakened
Token awakened
Feature activation+0.000
one
Token one
Feature activation+0.000
.
Token.
Feature activation+0.000
One
Token One
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
lot
Token lot
Feature activation+0.000
of
Token of
Feature activation+0.000
conditional
Token conditional
Feature activation+0.000
phr
Token phr
Feature activation+0.000
asing
Tokenasing
Feature activation+1.825
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
According
TokenAccording
Feature activation+0.000
:
Token:
Feature activation+0.000
What
Token What
Feature activation+0.000
would
Token would
Feature activation+0.000
be
Token be
Feature activation+0.000
the
Token the
Feature activation+0.000
meaning
Token meaning
Feature activation+1.809
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
Christian
Token Christian
Feature activation+0.000
gospel
Token gospel
Feature activation+0.000
if
Token if
Feature activation+0.000
Em
TokenEm
Feature activation+0.000
pt
Tokenpt
Feature activation+0.000
ying
Tokenying
Feature activation+0.000
words
Token words
Feature activation+1.141
of
Token of
Feature activation+0.000
meaning
Token meaning
Feature activation+1.745
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
essential
Token essential
Feature activation+0.000
step
Token step
Feature activation+0.000
on
Token on
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
vague
Token vague
Feature activation+0.000
term
Token term
Feature activation+0.000
often
Token often
Feature activation+0.000
used
Token used
Feature activation+1.690
to
Token to
Feature activation+0.000
laud
Token laud
Feature activation+0.000
making
Token making
Feature activation+0.000
outs
Token outs
Feature activation+0.000
as
Token as
Feature activation+0.000
:
Token:
Feature activation+0.000
What
Token What
Feature activation+0.000
word
Token word
Feature activation+0.000
do
Token do
Feature activation+0.000
you
Token you
Feature activation+0.000
use
Token use
Feature activation+1.630
?
Token?
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
"
Token"
Feature activation+0.075
Some
TokenSome
Feature activation+0.000
's
Token's
Feature activation+0.000
watching
Token watching
Feature activation+0.000
the
Token the
Feature activation+0.000
words
Token words
Feature activation+0.000
they
Token they
Feature activation+0.000
use
Token use
Feature activation+1.622
and
Token and
Feature activation+0.000
they
Token they
Feature activation+0.000
should
Token should
Feature activation+0.000
be
Token be
Feature activation+0.000
,
Token,
Feature activation+0.000
les
Tokenles
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
,
Token,
Feature activation+0.000
both
Token both
Feature activation+0.000
meaning
Token meaning
Feature activation+1.615
to
Token to
Feature activation+0.000
make
Token make
Feature activation+0.000
one
Token one
Feature activation+0.000
irritated
Token irritated
Feature activation+0.000
or
Token or
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Those
Token Those
Feature activation+0.000
words
Token words
Feature activation+0.000
were
Token were
Feature activation+0.000
spoken
Token spoken
Feature activation+1.584
by
Token by
Feature activation+0.000
America
Token America
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000

Top DFA by src position
MAX = 3.345

âĢ
TokenâĢ
Feature activation+0.004
Top resid features:
Ļ
TokenĻ
Feature activation+0.026
Top resid features:
ve
Tokenve
Feature activation+0.006
Top resid features:
taken
Token taken
Feature activation+0.027
Top resid features:
that
Token that
Feature activation+0.089
Top resid features:
phrase
Token phrase
Feature activation+3.345
Top resid features:
and
Token and
Feature activation+0.065
Top resid features:
applied
Token applied
Feature activation+0.067
Top resid features:
it
Token it
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
ata
Tokenata
Feature activation+0.010
Top resid features:
.
Token.
Feature activation+0.002
Top resid features:
Ċ
TokenĊ
Feature activation+0.024
Top resid features:
Ċ
TokenĊ
Feature activation+0.027
Top resid features:
The
TokenThe
Feature activation+0.016
Top resid features:
word
Token word
Feature activation+3.294
Top resid features:
karma
Token karma
Feature activation+0.078
Top resid features:
is
Token is
Feature activation+0.134
Top resid features:
also
Token also
Feature activation+0.053
Top resid features:
used
Token used
Feature activation+0.090
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation-0.001
Top resid features:
Ŀ
TokenĿ
Feature activation+0.031
Top resid features:
Ċ
TokenĊ
Feature activation+0.009
Top resid features:
Ċ
TokenĊ
Feature activation+0.017
Top resid features:
Those
TokenThose
Feature activation+0.040
Top resid features:
words
Token words
Feature activation+2.648
Top resid features:
,
Token,
Feature activation+0.044
Top resid features:
dep
Token dep
Feature activation+0.022
Top resid features:
raved
Tokenraved
Feature activation+0.031
Top resid features:
words
Token words
Feature activation+0.871
Top resid features:
,
Token,
Feature activation+0.081
Top resid features:
Conf
TokenConf
Feature activation-0.001
Top resid features:
using
Tokenusing
Feature activation+0.036
Top resid features:
ly
Tokenly
Feature activation+0.020
Top resid features:
,
Token,
Feature activation+0.024
Top resid features:
the
Token the
Feature activation+0.091
Top resid features:
word
Token word
Feature activation+3.163
Top resid features:
"
Token "
Feature activation+0.116
Top resid features:
el
Tokenel
Feature activation-0.013
Top resid features:
k
Tokenk
Feature activation-0.007
Top resid features:
"
Token"
Feature activation-0.014
Top resid features:
is
Token is
Feature activation+0.220
Top resid features:
ata
Tokenata
Feature activation-0.003
Top resid features:
.
Token.
Feature activation-0.002
Top resid features:
Ċ
TokenĊ
Feature activation+0.008
Top resid features:
Ċ
TokenĊ
Feature activation+0.007
Top resid features:
The
TokenThe
Feature activation+0.006
Top resid features:
word
Token word
Feature activation+3.207
Top resid features:
karma
Token karma
Feature activation+0.010
Top resid features:
is
Token is
Feature activation+0.001
Top resid features:
also
Token also
Feature activation-0.005
Top resid features:
used
Token used
Feature activation-0.001
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
os
Tokenos
Feature activation+0.008
Top resid features:
ophy
Tokenophy
Feature activation+0.014
Top resid features:
is
Token is
Feature activation+0.014
Top resid features:
a
Token a
Feature activation+0.029
Top resid features:
Greek
Token Greek
Feature activation+0.085
Top resid features:
word
Token word
Feature activation+2.980
Top resid features:
,
Token,
Feature activation+0.059
Top resid features:
which
Token which
Feature activation+0.253
Top resid features:
means
Token means
Feature activation+0.033
Top resid features:
âĢ
Token âĢ
Feature activation+0.000
Top resid features:
ľ
Tokenľ
Feature activation+0.000
Top resid features:
Conf
TokenConf
Feature activation-0.009
Top resid features:
using
Tokenusing
Feature activation+0.013
Top resid features:
ly
Tokenly
Feature activation+0.008
Top resid features:
,
Token,
Feature activation+0.002
Top resid features:
the
Token the
Feature activation+0.072
Top resid features:
word
Token word
Feature activation+3.188
Top resid features:
"
Token "
Feature activation+0.070
Top resid features:
el
Tokenel
Feature activation-0.013
Top resid features:
k
Tokenk
Feature activation-0.016
Top resid features:
"
Token"
Feature activation-0.024
Top resid features:
is
Token is
Feature activation+0.028
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.187
Top resid features:
hear
Token hear
Feature activation+0.069
Top resid features:
the
Token the
Feature activation+0.101
Top resid features:
word
Token word
Feature activation+2.654
Top resid features:
âĢ
Token âĢ
Feature activation+0.042
Top resid features:
ĺ
Tokenĺ
Feature activation+0.027
Top resid features:
com
Tokencom
Feature activation-0.014
Top resid features:
pre
Tokenpre
Feature activation+0.041
Top resid features:
hens
Tokenhens
Feature activation+0.053
Top resid features:
white
Token white
Feature activation+0.025
Top resid features:
police
Token police
Feature activation+0.012
Top resid features:
genocide
Token genocide
Feature activation+0.058
Top resid features:
(
Token (
Feature activation+0.070
Top resid features:
the
Tokenthe
Feature activation+0.038
Top resid features:
word
Token word
Feature activation+2.827
Top resid features:
used
Token used
Feature activation+0.072
Top resid features:
by
Token by
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
Cornell
Token Cornell
Feature activation+0.000
Top resid features:
professor
Token professor
Feature activation+0.000
Top resid features:
Ep
TokenEp
Feature activation+0.016
Top resid features:
hemer
Tokenhemer
Feature activation-0.020
Top resid features:
a
Tokena
Feature activation+0.008
Top resid features:
is
Token is
Feature activation+0.047
Top resid features:
a
Token a
Feature activation+0.048
Top resid features:
word
Token word
Feature activation+2.917
Top resid features:
of
Token of
Feature activation+0.080
Top resid features:
Greek
Token Greek
Feature activation-0.053
Top resid features:
origin
Token origin
Feature activation+0.000
Top resid features:
that
Token that
Feature activation+0.000
Top resid features:
signifies
Token signifies
Feature activation+0.000
Top resid features:
Ep
TokenEp
Feature activation+0.012
Top resid features:
hemer
Tokenhemer
Feature activation-0.005
Top resid features:
a
Tokena
Feature activation+0.003
Top resid features:
is
Token is
Feature activation+0.015
Top resid features:
a
Token a
Feature activation+0.015
Top resid features:
word
Token word
Feature activation+2.558
Top resid features:
of
Token of
Feature activation+0.014
Top resid features:
Greek
Token Greek
Feature activation+0.098
Top resid features:
origin
Token origin
Feature activation+0.040
Top resid features:
that
Token that
Feature activation+0.173
Top resid features:
signifies
Token signifies
Feature activation+0.128
Top resid features:
our
Token our
Feature activation-0.011
Top resid features:
precious
Token precious
Feature activation+0.027
Top resid features:
lives
Token lives
Feature activation+0.008
Top resid features:
.
Token.
Feature activation+0.026
Top resid features:
The
Token The
Feature activation+0.060
Top resid features:
word
Token word
Feature activation+2.703
Top resid features:
bud
Token bud
Feature activation+0.078
Top resid features:
d
Tokend
Feature activation+0.001
Top resid features:
ha
Tokenha
Feature activation+0.060
Top resid features:
means
Token means
Feature activation+0.024
Top resid features:
awakened
Token awakened
Feature activation+0.000
Top resid features:
uses
Token uses
Feature activation+0.084
Top resid features:
a
Token a
Feature activation+0.045
Top resid features:
lot
Token lot
Feature activation+0.064
Top resid features:
of
Token of
Feature activation+0.014
Top resid features:
conditional
Token conditional
Feature activation+0.006
Top resid features:
phr
Token phr
Feature activation+2.860
Top resid features:
asing
Tokenasing
Feature activation-0.110
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ŀ
TokenĿ
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.000
Top resid features:
?
Token?
Feature activation+0.021
Top resid features:
Ċ
TokenĊ
Feature activation+0.042
Top resid features:
Ċ
TokenĊ
Feature activation+0.035
Top resid features:
To
TokenTo
Feature activation+0.010
Top resid features:
re
Token re
Feature activation+0.008
Top resid features:
phrase
Tokenphrase
Feature activation+2.297
Top resid features:
the
Token the
Feature activation+0.065
Top resid features:
question
Token question
Feature activation+0.110
Top resid features:
:
Token:
Feature activation+0.085
Top resid features:
What
Token What
Feature activation+0.145
Top resid features:
would
Token would
Feature activation+0.060
Top resid features:
aching
Tokenaching
Feature activation+0.006
Top resid features:
corrupt
Token corrupt
Feature activation+0.012
Top resid features:
ions
Tokenions
Feature activation+0.004
Top resid features:
of
Token of
Feature activation-0.003
Top resid features:
political
Token political
Feature activation+0.020
Top resid features:
vocabulary
Token vocabulary
Feature activation+1.171
Top resid features:
ske
Token ske
Feature activation+0.014
Top resid features:
wered
Tokenwered
Feature activation+0.026
Top resid features:
by
Token by
Feature activation+0.031
Top resid features:
Orwell
Token Orwell
Feature activation+0.039
Top resid features:
seventy
Token seventy
Feature activation+0.023
Top resid features:
ational
Tokenational
Feature activation+0.017
Top resid features:
hitting
Token hitting
Feature activation+0.065
Top resid features:
is
Token is
Feature activation+0.072
Top resid features:
a
Token a
Feature activation+0.051
Top resid features:
vague
Token vague
Feature activation+0.113
Top resid features:
term
Token term
Feature activation+2.423
Top resid features:
often
Token often
Feature activation+0.124
Top resid features:
used
Token used
Feature activation+0.066
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
laud
Token laud
Feature activation+0.000
Top resid features:
making
Token making
Feature activation+0.000
Top resid features:
rest
Token rest
Feature activation+0.016
Top resid features:
of
Token of
Feature activation+0.007
Top resid features:
society
Token society
Feature activation+0.034
Top resid features:
:
Token:
Feature activation+0.042
Top resid features:
What
Token What
Feature activation+0.043
Top resid features:
word
Token word
Feature activation+2.125
Top resid features:
do
Token do
Feature activation+0.120
Top resid features:
you
Token you
Feature activation+0.071
Top resid features:
use
Token use
Feature activation+0.058
Top resid features:
?
Token?
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.031
Top resid features:
everybody
Token everybody
Feature activation+0.052
Top resid features:
's
Token's
Feature activation+0.027
Top resid features:
watching
Token watching
Feature activation+0.057
Top resid features:
the
Token the
Feature activation+0.033
Top resid features:
words
Token words
Feature activation+2.375
Top resid features:
they
Token they
Feature activation+0.076
Top resid features:
use
Token use
Feature activation+0.034
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
they
Token they
Feature activation+0.000
Top resid features:
should
Token should
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.018
Top resid features:
word
Token word
Feature activation+1.378
Top resid features:
âĢ
Token âĢ
Feature activation+0.005
Top resid features:
ľ
Tokenľ
Feature activation+0.007
Top resid features:
f
Tokenf
Feature activation+0.003
Top resid features:
end
Tokenend
Feature activation+0.008
Top resid features:
âĢ
TokenâĢ
Feature activation+0.002
Top resid features:
peace
Token peace
Feature activation+0.033
Top resid features:
.
Token.
Feature activation+0.023
Top resid features:
âĢ
TokenâĢ
Feature activation+0.006
Top resid features:
Ŀ
TokenĿ
Feature activation+0.093
Top resid features:
Those
Token Those
Feature activation+0.080
Top resid features:
words
Token words
Feature activation+2.673
Top resid features:
were
Token were
Feature activation+0.057
Top resid features:
spoken
Token spoken
Feature activation-0.076
Top resid features:
by
Token by
Feature activation+0.000
Top resid features:
America
Token America
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.03

Head 1: 0.03

Head 2: 0.04

Head 3: 0.01

Head 4: 0.04

Head 5: 0.08

Head 6: 0.56

Head 7: 0.05

Head 8: 0.02

Head 9: 0.02

Head 10: 0.03

Head 11: 0.08

Positive logits

uttered2.35

phrase2.22

abbre2.18

acronym2.14

phrases2.11

pronounce2.09

Liter1.99

utter1.96

Twain1.94

unciation1.90

م1.88

abbrevi1.88

stereotypes1.87

oons1.85

tymology1.84

Latin1.83

English1.83

classics1.82

204391.81

Adult1.77

Negative logits

ascular-2.07

pending-2.00

indemn-1.84

payroll-1.81

feasibility-1.77

servicing-1.72

Frey-1.69

staffing-1.69

jriwal-1.69

etheus-1.69

compensated-1.69

perimeter-1.69

compensation-1.68

pri-1.67

pipeline-1.67

cleanup-1.65

planned-1.64

managing-1.64

shutdown-1.63

confidential-1.63

INTERVAL 2.545 - 2.828
CONTAINS 0.000%

The
TokenThe
Feature activation+0.000
word
Token word
Feature activation+0.000
karma
Token karma
Feature activation+0.000
is
Token is
Feature activation+0.000
also
Token also
Feature activation+0.000
used
Token used
Feature activation+2.726
in
Token in
Feature activation+0.169
different
Token different
Feature activation+0.197
contexts
Token contexts
Feature activation+0.817
.
Token.
Feature activation+0.000
Y
Token Y
Feature activation+0.011
ve
Tokenve
Feature activation+0.000
taken
Token taken
Feature activation+0.000
that
Token that
Feature activation+0.000
phrase
Token phrase
Feature activation+0.128
and
Token and
Feature activation+0.000
applied
Token applied
Feature activation+2.828
it
Token it
Feature activation+0.011
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
sport
Token sport
Feature activation+0.000
and
Token and
Feature activation+0.000

INTERVAL 2.262 - 2.545
CONTAINS 0.001%

,
Token,
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
what
Token what
Feature activation+0.000
it
Token it
Feature activation+0.000
means
Token means
Feature activation+2.326
in
Token in
Feature activation+0.026
a
Token a
Feature activation+0.000
loose
Token loose
Feature activation+0.000
translation
Token translation
Feature activation+1.249
is
Token is
Feature activation+0.190
most
Token most
Feature activation+0.000
general
Token general
Feature activation+0.061
sense
Token sense
Feature activation+0.380
,
Token,
Feature activation+0.000
karma
Token karma
Feature activation+0.000
refers
Token refers
Feature activation+2.411
to
Token to
Feature activation+0.000
any
Token any
Feature activation+0.000
action
Token action
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
"
Token "
Feature activation+0.597
el
Tokenel
Feature activation+0.000
k
Tokenk
Feature activation+0.000
"
Token"
Feature activation+0.477
is
Token is
Feature activation+0.000
used
Token used
Feature activation+2.423
in
Token in
Feature activation+0.000
North
Token North
Feature activation+0.000
America
Token America
Feature activation+0.000
to
Token to
Feature activation+0.000
refer
Token refer
Feature activation+2.387
a
Token a
Feature activation+0.000
Greek
Token Greek
Feature activation+0.000
word
Token word
Feature activation+0.593
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
means
Token means
Feature activation+2.399
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
love
Tokenlove
Feature activation+0.000
of
Token of
Feature activation+0.000
wisdom
Token wisdom
Feature activation+0.000
used
Token used
Feature activation+2.423
in
Token in
Feature activation+0.000
North
Token North
Feature activation+0.000
America
Token America
Feature activation+0.000
to
Token to
Feature activation+0.000
refer
Token refer
Feature activation+2.387
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
different
Token different
Feature activation+0.000
animal
Token animal
Feature activation+0.000
,
Token,
Feature activation+0.000

INTERVAL 1.979 - 2.262
CONTAINS 0.000%

police
Token police
Feature activation+0.000
genocide
Token genocide
Feature activation+0.000
(
Token (
Feature activation+0.000
the
Tokenthe
Feature activation+0.000
word
Token word
Feature activation+0.000
used
Token used
Feature activation+2.127
by
Token by
Feature activation+0.000
a
Token a
Feature activation+0.000
Cornell
Token Cornell
Feature activation+0.000
professor
Token professor
Feature activation+0.000
recently
Token recently
Feature activation+0.000
a
Tokena
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
word
Token word
Feature activation+0.000
of
Token of
Feature activation+0.000
Greek
Token Greek
Feature activation+2.080
origin
Token origin
Feature activation+0.705
that
Token that
Feature activation+0.000
signifies
Token signifies
Feature activation+2.005
something
Token something
Feature activation+0.000
being
Token being
Feature activation+0.000
word
Token word
Feature activation+0.000
of
Token of
Feature activation+0.000
Greek
Token Greek
Feature activation+2.080
origin
Token origin
Feature activation+0.705
that
Token that
Feature activation+0.000
signifies
Token signifies
Feature activation+2.005
something
Token something
Feature activation+0.000
being
Token being
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
short
Token short
Feature activation+0.000

INTERVAL 1.697 - 1.979
CONTAINS 0.001%

The
Token The
Feature activation+0.000
word
Token word
Feature activation+0.000
bud
Token bud
Feature activation+0.000
d
Tokend
Feature activation+0.000
ha
Tokenha
Feature activation+0.000
means
Token means
Feature activation+1.918
awakened
Token awakened
Feature activation+0.000
one
Token one
Feature activation+0.000
.
Token.
Feature activation+0.000
One
Token One
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
lot
Token lot
Feature activation+0.000
of
Token of
Feature activation+0.000
conditional
Token conditional
Feature activation+0.000
phr
Token phr
Feature activation+0.000
asing
Tokenasing
Feature activation+1.825
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
According
TokenAccording
Feature activation+0.000
:
Token:
Feature activation+0.000
What
Token What
Feature activation+0.000
would
Token would
Feature activation+0.000
be
Token be
Feature activation+0.000
the
Token the
Feature activation+0.000
meaning
Token meaning
Feature activation+1.809
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
Christian
Token Christian
Feature activation+0.000
gospel
Token gospel
Feature activation+0.000
if
Token if
Feature activation+0.000
Em
TokenEm
Feature activation+0.000
pt
Tokenpt
Feature activation+0.000
ying
Tokenying
Feature activation+0.000
words
Token words
Feature activation+1.141
of
Token of
Feature activation+0.000
meaning
Token meaning
Feature activation+1.745
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
essential
Token essential
Feature activation+0.000
step
Token step
Feature activation+0.000
on
Token on
Feature activation+0.000

INTERVAL 1.414 - 1.697
CONTAINS 0.001%

;
Token;
Feature activation+0.000
c
Token c
Feature activation+0.189
ihu
Tokenihu
Feature activation+0.000
atl
Tokenatl
Feature activation+0.000
(
Token (
Feature activation+0.172
meaning
Tokenmeaning
Feature activation+1.423
"
Token "
Feature activation+0.907
woman
Tokenwoman
Feature activation+0.000
")
Token")
Feature activation+0.281
and
Token and
Feature activation+0.000
mat
Token mat
Feature activation+0.000
reject
Token reject
Feature activation+0.000
it
Token it
Feature activation+0.000
.
Token.
Feature activation+0.000
In
Token In
Feature activation+0.000
phr
Token phr
Feature activation+0.000
asing
Tokenasing
Feature activation+1.559
its
Token its
Feature activation+0.000
question
Token question
Feature activation+0.000
to
Token to
Feature activation+0.000
1600
Token 1600
Feature activation+0.000
respondents
Token respondents
Feature activation+0.000
just
Token just
Feature activation+0.000
wouldn
Token wouldn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
use
Token use
Feature activation+1.422
in
Token in
Feature activation+0.000
everyday
Token everyday
Feature activation+0.000
talk
Token talk
Feature activation+0.000
and
Token and
Feature activation+0.000
therefore
Token therefore
Feature activation+0.000
's
Token's
Feature activation+0.000
watching
Token watching
Feature activation+0.000
the
Token the
Feature activation+0.000
words
Token words
Feature activation+0.000
they
Token they
Feature activation+0.000
use
Token use
Feature activation+1.622
and
Token and
Feature activation+0.000
they
Token they
Feature activation+0.000
should
Token should
Feature activation+0.000
be
Token be
Feature activation+0.000
,
Token,
Feature activation+0.000
:
Token:
Feature activation+0.000
What
Token What
Feature activation+0.000
word
Token word
Feature activation+0.000
do
Token do
Feature activation+0.000
you
Token you
Feature activation+0.000
use
Token use
Feature activation+1.630
?
Token?
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
"
Token"
Feature activation+0.075
Some
TokenSome
Feature activation+0.000

INTERVAL 1.131 - 1.414
CONTAINS 0.001%

work
Token work
Feature activation+0.000
in
Token in
Feature activation+0.000
elite
Token elite
Feature activation+0.000
sport
Token sport
Feature activation+0.000
is
Token is
Feature activation+0.000
"
Token "
Feature activation+1.381
bec
Tokenbec
Feature activation+0.000
ome
Tokenome
Feature activation+0.000
comfortable
Token comfortable
Feature activation+0.000
with
Token with
Feature activation+0.000
the
Token the
Feature activation+0.000
flag
Token flag
Feature activation+0.000
operation
Token operation
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
in
Token in
Feature activation+0.000
reference
Token reference
Feature activation+1.239
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
CIA
Token CIA
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
agree
Token agree
Feature activation+0.000
what
Token what
Feature activation+0.000
loving
Token loving
Feature activation+0.000
wisdom
Token wisdom
Feature activation+0.000
actually
Token actually
Feature activation+0.000
means
Token means
Feature activation+1.279
,
Token,
Feature activation+0.000
so
Token so
Feature activation+0.000
they
Token they
Feature activation+0.000
can
Token can
Feature activation+0.000
only
Token only
Feature activation+0.000
and
Token and
Feature activation+0.000
mat
Token mat
Feature activation+0.000
lat
Tokenlat
Feature activation+0.000
l
Tokenl
Feature activation+0.000
(
Token (
Feature activation+0.140
meaning
Tokenmeaning
Feature activation+1.407
"
Token "
Feature activation+0.814
net
Tokennet
Feature activation+0.081
").
Token").
Feature activation+0.000
This
Token This
Feature activation+0.000
"
Token "
Feature activation+0.844
Ċ
TokenĊ
Feature activation+0.000
It
TokenIt
Feature activation+0.000
was
Token was
Feature activation+0.000
in
Token in
Feature activation+0.000
this
Token this
Feature activation+0.000
context
Token context
Feature activation+1.173
that
Token that
Feature activation+0.000
I
Token I
Feature activation+0.000
sought
Token sought
Feature activation+0.000
to
Token to
Feature activation+0.000
translate
Token translate
Feature activation+0.290

INTERVAL 0.848 - 1.131
CONTAINS 0.003%

trans
Token trans
Feature activation+0.000
ph
Tokenph
Feature activation+0.000
obic
Tokenobic
Feature activation+0.000
slurs
Token slurs
Feature activation+0.000
being
Token being
Feature activation+0.000
used
Token used
Feature activation+0.979
on
Token on
Feature activation+0.000
the
Token the
Feature activation+0.000
show
Token show
Feature activation+0.000
.
Token.
Feature activation+0.000
However
Token However
Feature activation+0.000
movement
Token movement
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
common
Token common
Feature activation+0.015
law
Token law
Feature activation+0.000
definition
Token definition
Feature activation+1.022
of
Token of
Feature activation+0.000
l
Token l
Feature activation+0.000
arc
Tokenarc
Feature activation+0.000
eny
Tokeneny
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
still
Token still
Feature activation+0.000
no
Token no
Feature activation+0.000
words
Token words
Feature activation+0.000
were
Token were
Feature activation+0.000
spoken
Token spoken
Feature activation+1.098
.
Token.
Feature activation+0.000
That
Token That
Feature activation+0.000
was
Token was
Feature activation+0.000
the
Token the
Feature activation+0.000
year
Token year
Feature activation+0.000
ism
Tokenism
Feature activation+0.538
that
Token that
Feature activation+0.000
the
Token the
Feature activation+0.000
Beat
Token Beat
Feature activation+0.000
poets
Token poets
Feature activation+0.000
used
Token used
Feature activation+1.129
to
Token to
Feature activation+0.000
use
Token use
Feature activation+1.061
for
Token for
Feature activation+0.348
getting
Token getting
Feature activation+0.041
high
Token high
Feature activation+0.044
.
Token.
Feature activation+0.000
Every
Token Every
Feature activation+0.000
proposed
Token proposed
Feature activation+0.000
presidential
Token presidential
Feature activation+0.000
utter
Token utter
Feature activation+0.000
ance
Tokenance
Feature activation+0.885
is
Token is
Feature activation+0.000
scrub
Token scrub
Feature activation+0.000
bed
Tokenbed
Feature activation+0.000
for
Token for
Feature activation+0.000
accuracy
Token accuracy
Feature activation+0.000

INTERVAL 0.566 - 0.848
CONTAINS 0.007%

wide
Token wide
Feature activation+0.000
range
Token range
Feature activation+0.000
of
Token of
Feature activation+0.000
utter
Token utter
Feature activation+0.000
falsehood
Token falsehood
Feature activation+0.000
s
Tokens
Feature activation+0.674
,
Token,
Feature activation+0.000
all
Token all
Feature activation+0.000
designed
Token designed
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+0.006
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.000
looked
Token looked
Feature activation+0.000
up
Token up
Feature activation+0.000
common
Token common
Feature activation+0.062
English
Token English
Feature activation+0.595
words
Token words
Feature activation+0.402
ending
Token ending
Feature activation+0.157
in
Token in
Feature activation+0.000
-
Token -
Feature activation+0.000
o
Tokeno
Feature activation+0.000
economist
Token economist
Feature activation+0.000
Milton
Token Milton
Feature activation+0.000
Friedman
Token Friedman
Feature activation+0.000
uttered
Token uttered
Feature activation+0.000
the
Token the
Feature activation+0.000
words
Token words
Feature activation+0.581
quoted
Token quoted
Feature activation+0.000
above
Token above
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
President
Token President
Feature activation+0.000
the
Tokenthe
Feature activation+0.000
term
Token term
Feature activation+0.000
may
Token may
Feature activation+0.000
also
Token also
Feature activation+0.000
be
Token be
Feature activation+0.000
applied
Token applied
Feature activation+0.756
to
Token to
Feature activation+0.000
many
Token many
Feature activation+0.000
nont
Token nont
Feature activation+0.000
rad
Tokenrad
Feature activation+0.000
itional
Tokenitional
Feature activation+0.000
under
Tokenunder
Feature activation+0.000
stated
Tokenstated
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
to
Token to
Feature activation+0.000
describe
Token describe
Feature activation+0.797
the
Token the
Feature activation+0.000
executives
Token executives
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
misrepresent
Token misrepresent
Feature activation+0.046

INTERVAL 0.283 - 0.566
CONTAINS 0.011%

ge
Token ge
Feature activation+0.163
in
Tokenin
Feature activation+0.000
om
Tokenom
Feature activation+0.000
ai
Tokenai
Feature activation+0.000
),
Token),
Feature activation+0.000
meaning
Token meaning
Feature activation+0.554
"
Token "
Feature activation+0.274
I
TokenI
Feature activation+0.007
form
Token form
Feature activation+0.140
/
Token/
Feature activation+0.000
be
Tokenbe
Feature activation+0.000
arc
Tokenarc
Feature activation+0.000
eny
Tokeneny
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
court
Token court
Feature activation+0.000
said
Token said
Feature activation+0.306
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
opinion
Token opinion
Feature activation+0.000
lot
Token lot
Feature activation+0.000
of
Token of
Feature activation+0.000
reasons
Token reasons
Feature activation+0.000
to
Token to
Feature activation+0.000
like
Token like
Feature activation+0.000
games
Token games
Feature activation+0.388
like
Token like
Feature activation+0.000
Trials
Token Trials
Feature activation+0.000
or
Token or
Feature activation+0.000
Dark
Token Dark
Feature activation+0.000
Souls
Token Souls
Feature activation+0.000
The
TokenThe
Feature activation+0.000
term
Token term
Feature activation+0.000
has
Token has
Feature activation+0.000
come
Token come
Feature activation+0.000
to
Token to
Feature activation+0.000
represent
Token represent
Feature activation+0.338
white
Token white
Feature activation+0.000
Republicans
Token Republicans
Feature activation+0.000
and
Token and
Feature activation+0.000
.
Token .
Feature activation+0.000
.
Token .
Feature activation+0.000
several
Token several
Feature activation+0.000
ideas
Token ideas
Feature activation+0.000
into
Token into
Feature activation+0.000
a
Token a
Feature activation+0.000
single
Token single
Feature activation+0.000
long
Token long
Feature activation+0.484
and
Token and
Feature activation+0.000
winding
Token winding
Feature activation+0.000
sentence
Token sentence
Feature activation+0.558
,
Token,
Feature activation+0.000
can
Token can
Feature activation+0.000

INTERVAL 0.000 - 0.283
CONTAINS 99.976%

stand
Token stand
Feature activation+0.000
to
Token to
Feature activation+0.000
benefit
Token benefit
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
meantime
Token meantime
Feature activation+0.000
.
Token.
Feature activation+0.000
In
Token In
Feature activation+0.000
other
Token other
Feature activation+0.000
words
Token words
Feature activation+0.000
,
Token,
Feature activation+0.000
16
Token 16
Feature activation+0.000
minutes
Token minutes
Feature activation+0.000
after
Token after
Feature activation+0.000
the
Token the
Feature activation+0.000
execution
Token execution
Feature activation+0.000
began
Token began
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
without
Token without
Feature activation+0.000
Lock
Token Lock
Feature activation+0.000
ett
Tokenett
Feature activation+0.000
¦
Token¦
Feature activation+0.000
à¨
Tokenà¨
Feature activation+0.000
°
Token°
Feature activation+0.000
)
Token)
Feature activation+0.000
(
Token (
Feature activation+0.000
H
TokenH
Feature activation+0.000
ind
Tokenind
Feature activation+0.000
i
Tokeni
Feature activation+0.000
:
Token:
Feature activation+0.000
à¤
Token à¤
Feature activation+0.000
¦
Token¦
Feature activation+0.000
entered
Token entered
Feature activation+0.000
the
Token the
Feature activation+0.000
crowd
Token crowd
Feature activation+0.000
where
Token where
Feature activation+0.000
they
Token they
Feature activation+0.000
sexually
Token sexually
Feature activation+0.000
assaulted
Token assaulted
Feature activation+0.000
women
Token women
Feature activation+0.000
and
Token and
Feature activation+0.000
pick
Token pick
Feature activation+0.000
-
Token-
Feature activation+0.000
the
Token the
Feature activation+0.000
age
Token age
Feature activation+0.000
of
Token of
Feature activation+0.000
93
Token 93
Feature activation+0.000
,
Token,
Feature activation+0.000
has
Token has
Feature activation+0.000
passed
Token passed
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Born
TokenBorn
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 1 in H1.6: (feature 790

TOP ACTIVATIONS
MAX = 3.221

project
Token project
Feature activation+0.170
failure
Token failure
Feature activation+0.313
case
Token case
Feature activation+0.362
,
Token,
Feature activation+0.000
was
Token was
Feature activation+0.000
sent
Token sent
Feature activation+3.221
by
Token by
Feature activation+0.328
one
Token one
Feature activation+0.000
programmer
Token programmer
Feature activation+0.599
to
Token to
Feature activation+0.413
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.000
sent
Token sent
Feature activation+3.115
to
Token to
Feature activation+1.119
AZ
Token AZ
Feature activation+0.000
B
TokenB
Feature activation+0.000
Partners
Token Partners
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.138
sent
Token sent
Feature activation+3.013
to
Token to
Feature activation+1.374
Mur
Token Mur
Feature activation+0.000
thy
Tokenthy
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
comment
Token comment
Feature activation+0.000
of
Token of
Feature activation+0.000
email
Token email
Feature activation+0.000
filters
Token filters
Feature activation+0.340
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
sender
Token sender
Feature activation+2.731
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.432
reputation
Token reputation
Feature activation+0.000
causes
Token causes
Feature activation+0.000
B
TokenB
Feature activation+0.000
Partners
Token Partners
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
comment
Token comment
Feature activation+0.000
went
Token went
Feature activation+0.000
unanswered
Token unanswered
Feature activation+2.556
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.138
other
Token other
Feature activation+0.000
principals
Token principals
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
district
Token district
Feature activation+0.000
received
Token received
Feature activation+2.531
email
Token email
Feature activation+0.000
threats
Token threats
Feature activation+0.000
,
Token,
Feature activation+0.000
Mun
Token Mun
Feature activation+0.000
ro
Tokenro
Feature activation+0.000
he
Token he
Feature activation+0.000
addressed
Token addressed
Feature activation+0.000
an
Token an
Feature activation+0.000
email
Token email
Feature activation+0.000
he
Token he
Feature activation+0.000
sent
Token sent
Feature activation+2.487
to
Token to
Feature activation+0.518
Hillary
Token Hillary
Feature activation+0.000
Clinton
Token Clinton
Feature activation+0.000
telling
Token telling
Feature activation+0.000
her
Token her
Feature activation+0.000
several
Token several
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mails
Tokenmails
Feature activation+0.286
I
Token I
Feature activation+0.205
received
Token received
Feature activation+2.462
at
Token at
Feature activation+0.242
the
Token the
Feature activation+0.000
time
Token time
Feature activation+0.000
.
Token.
Feature activation+0.000
But
Token But
Feature activation+0.000
,
Token,
Feature activation+0.000
she
Token she
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.000
ed
Tokened
Feature activation+2.459
Mori
Token Mori
Feature activation+0.000
arity
Tokenarity
Feature activation+0.000
the
Token the
Feature activation+0.000
36
Token 36
Feature activation+0.000
-
Token-
Feature activation+0.000
conservatives
Token conservatives
Feature activation+0.000
,
Token,
Feature activation+0.000
supporters
Token supporters
Feature activation+0.000
who
Token who
Feature activation+0.000
reliably
Token reliably
Feature activation+0.000
respond
Token respond
Feature activation+2.223
in
Token in
Feature activation+0.000
large
Token large
Feature activation+0.000
numbers
Token numbers
Feature activation+0.000
of
Token of
Feature activation+0.000
small
Token small
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
accompanying
Token accompanying
Feature activation+0.000
email
Token email
Feature activation+0.000
from
Token from
Feature activation+2.138
the
Token the
Feature activation+0.000
Kasich
Token Kasich
Feature activation+0.000
campaign
Token campaign
Feature activation+0.000
emphasized
Token emphasized
Feature activation+0.000
that
Token that
Feature activation+0.000
mistakenly
Token mistakenly
Feature activation+0.000
filtered
Token filtered
Feature activation+0.241
as
Token as
Feature activation+0.000
spam
Token spam
Feature activation+0.000
or
Token or
Feature activation+0.000
sent
Token sent
Feature activation+2.103
into
Token into
Feature activation+0.000
junk
Token junk
Feature activation+0.000
box
Token box
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
domain
Token domain
Feature activation+0.000
it
Token it
Feature activation+0.000
is
Token is
Feature activation+0.000
sent
Token sent
Feature activation+2.087
from
Token from
Feature activation+0.893
.
Token.
Feature activation+0.000
Even
Token Even
Feature activation+0.000
from
Token from
Feature activation+1.271
a
Token a
Feature activation+0.000
that
Token that
Feature activation+0.000
were
Token were
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.000
ed
Tokened
Feature activation+2.059
daily
Token daily
Feature activation+0.000
to
Token to
Feature activation+0.353
a
Token a
Feature activation+0.000
national
Token national
Feature activation+0.000
network
Token network
Feature activation+0.000
at
Token at
Feature activation+0.550
This
Token This
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.340
address
Token address
Feature activation+2.052
is
Token is
Feature activation+0.023
being
Token being
Feature activation+0.000
protected
Token protected
Feature activation+0.293
from
Token from
Feature activation+0.968
sp
Token sp
Feature activation+0.000
wd
Tokenwd
Feature activation+0.000
:
Token:
Feature activation+0.000
Jeb
Token Jeb
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
From
TokenFrom
Feature activation+1.930
:
Token:
Feature activation+0.000
jp
Tokenjp
Feature activation+0.000
66
Token66
Feature activation+0.000
@
Token@
Feature activation+1.416
hillary
Tokenhillary
Feature activation+0.000
The
Token The
Feature activation+0.000
following
Token following
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
email
Token email
Feature activation+0.000
from
Token from
Feature activation+1.905
Terry
Token Terry
Feature activation+0.054
Kramer
Token Kramer
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
US
Token US
Feature activation+0.000
!
Token!
Feature activation+0.000
Sorry
Token Sorry
Feature activation+0.000
for
Token for
Feature activation+0.000
my
Token my
Feature activation+0.000
poor
Token poor
Feature activation+0.000
response
Token response
Feature activation+1.844
.
Token.
Feature activation+0.000
Would
Token Would
Feature activation+0.000
have
Token have
Feature activation+0.000
love
Token love
Feature activation+0.000
to
Token to
Feature activation+0.175
back
Token back
Feature activation+0.136
for
Token for
Feature activation+0.000
him
Token him
Feature activation+0.000
and
Token and
Feature activation+0.000
he
Token he
Feature activation+0.000
replied
Token replied
Feature activation+1.802
,
Token,
Feature activation+0.000
"
Token "
Feature activation+0.000
If
TokenIf
Feature activation+0.225
can
Token can
Feature activation+0.000
coach
Token coach
Feature activation+0.000
K
Token K
Feature activation+0.000
OT
TokenOT
Feature activation+0.000
C
TokenC
Feature activation+0.000
and
Token and
Feature activation+0.000
he
Token he
Feature activation+0.000
replied
Token replied
Feature activation+1.695
"
Token "
Feature activation+0.000
No
TokenNo
Feature activation+0.000
,
Token,
Feature activation+0.000
because
Token because
Feature activation+0.000
I
Token I
Feature activation+0.000

Top DFA by src position
MAX = 4.702

and
Token and
Feature activation-0.005
Top resid features:
letters
Token letters
Feature activation+0.189
Top resid features:
,
Token,
Feature activation-0.005
Top resid features:
e
Token e
Feature activation+0.030
Top resid features:
-
Token-
Feature activation-0.001
Top resid features:
mails
Tokenmails
Feature activation+1.848
Top resid features:
contain
Token contain
Feature activation+0.008
Top resid features:
un
Token un
Feature activation+0.003
Top resid features:
v
Tokenv
Feature activation+0.001
Top resid features:
arn
Tokenarn
Feature activation+0.009
Top resid features:
ished
Tokenished
Feature activation+0.005
Top resid features:
above
Token above
Feature activation-0.005
Top resid features:
.
Token.
Feature activation+0.034
Top resid features:
Ċ
TokenĊ
Feature activation+0.026
Top resid features:
Ċ
TokenĊ
Feature activation+0.028
Top resid features:
An
TokenAn
Feature activation+0.111
Top resid features:
email
Token email
Feature activation+4.702
Top resid features:
sent
Token sent
Feature activation-0.368
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
AZ
Token AZ
Feature activation+0.000
Top resid features:
B
TokenB
Feature activation+0.000
Top resid features:
Partners
Token Partners
Feature activation+0.000
Top resid features:
above
Token above
Feature activation-0.001
Top resid features:
.
Token.
Feature activation-0.015
Top resid features:
Ċ
TokenĊ
Feature activation-0.015
Top resid features:
Ċ
TokenĊ
Feature activation-0.014
Top resid features:
An
TokenAn
Feature activation+0.008
Top resid features:
email
Token email
Feature activation+2.347
Top resid features:
sent
Token sent
Feature activation-0.133
Top resid features:
to
Token to
Feature activation-0.005
Top resid features:
AZ
Token AZ
Feature activation-0.001
Top resid features:
B
TokenB
Feature activation-0.001
Top resid features:
Partners
Token Partners
Feature activation-0.007
Top resid features:
deliver
Token deliver
Feature activation-0.001
Top resid features:
ability
Tokenability
Feature activation+0.005
Top resid features:
.
Token.
Feature activation-0.009
Top resid features:
Ċ
TokenĊ
Feature activation-0.008
Top resid features:
Ċ
TokenĊ
Feature activation-0.003
Top resid features:
Email
TokenEmail
Feature activation+1.173
Top resid features:
Fil
Token Fil
Feature activation+0.003
Top resid features:
tering
Tokentering
Feature activation+0.012
Top resid features:
by
Token by
Feature activation+0.003
Top resid features:
ISPs
Token ISPs
Feature activation+0.005
Top resid features:
and
Token and
Feature activation-0.003
Top resid features:
above
Token above
Feature activation-0.003
Top resid features:
.
Token.
Feature activation-0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.003
Top resid features:
Ċ
TokenĊ
Feature activation-0.005
Top resid features:
An
TokenAn
Feature activation+0.015
Top resid features:
email
Token email
Feature activation+3.825
Top resid features:
sent
Token sent
Feature activation+0.150
Top resid features:
to
Token to
Feature activation-0.015
Top resid features:
AZ
Token AZ
Feature activation+0.005
Top resid features:
B
TokenB
Feature activation+0.013
Top resid features:
Partners
Token Partners
Feature activation+0.019
Top resid features:
,"
Token,"
Feature activation+0.002
Top resid features:
Clark
Token Clark
Feature activation-0.001
Top resid features:
said
Token said
Feature activation+0.012
Top resid features:
.
Token.
Feature activation-0.018
Top resid features:
The
TokenThe
Feature activation-0.001
Top resid features:
email
Token email
Feature activation+1.193
Top resid features:
mentioned
Token mentioned
Feature activation+0.000
Top resid features:
specific
Token specific
Feature activation+0.015
Top resid features:
weapons
Token weapons
Feature activation+0.044
Top resid features:
and
Token and
Feature activation+0.018
Top resid features:
locations
Token locations
Feature activation+0.025
Top resid features:
Thursday
Token Thursday
Feature activation+0.063
Top resid features:
,
Token,
Feature activation-0.024
Top resid features:
he
Token he
Feature activation+0.018
Top resid features:
addressed
Token addressed
Feature activation+0.063
Top resid features:
an
Token an
Feature activation+0.073
Top resid features:
email
Token email
Feature activation+3.875
Top resid features:
he
Token he
Feature activation+0.010
Top resid features:
sent
Token sent
Feature activation-0.408
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
Hillary
Token Hillary
Feature activation+0.000
Top resid features:
Clinton
Token Clinton
Feature activation+0.000
Top resid features:
according
Token according
Feature activation+0.064
Top resid features:
to
Token to
Feature activation-0.004
Top resid features:
several
Token several
Feature activation+0.041
Top resid features:
e
Token e
Feature activation+0.058
Top resid features:
-
Token-
Feature activation+0.015
Top resid features:
mails
Tokenmails
Feature activation+3.507
Top resid features:
I
Token I
Feature activation+0.044
Top resid features:
received
Token received
Feature activation-0.225
Top resid features:
at
Token at
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
time
Token time
Feature activation+0.000
Top resid features:
case
Token case
Feature activation+0.006
Top resid features:
,
Token,
Feature activation+0.061
Top resid features:
she
Token she
Feature activation+0.013
Top resid features:
e
Token e
Feature activation+0.097
Top resid features:
-
Token-
Feature activation+0.075
Top resid features:
mail
Tokenmail
Feature activation+3.552
Top resid features:
ed
Tokened
Feature activation-0.249
Top resid features:
Mori
Token Mori
Feature activation+0.000
Top resid features:
arity
Tokenarity
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
36
Token 36
Feature activation+0.000
Top resid features:
pitches
Token pitches
Feature activation+0.026
Top resid features:
that
Token that
Feature activation+0.018
Top resid features:
were
Token were
Feature activation+0.062
Top resid features:
e
Token e
Feature activation+0.072
Top resid features:
-
Token-
Feature activation-0.007
Top resid features:
mail
Tokenmail
Feature activation+3.203
Top resid features:
ed
Tokened
Feature activation+0.021
Top resid features:
daily
Token daily
Feature activation+0.009
Top resid features:
to
Token to
Feature activation-0.003
Top resid features:
a
Token a
Feature activation+0.036
Top resid features:
national
Token national
Feature activation-0.019
Top resid features:
."
Token."
Feature activation+0.038
Top resid features:
Ċ
TokenĊ
Feature activation+0.007
Top resid features:
Ċ
TokenĊ
Feature activation+0.011
Top resid features:
An
TokenAn
Feature activation+0.066
Top resid features:
accompanying
Token accompanying
Feature activation+0.010
Top resid features:
email
Token email
Feature activation+3.175
Top resid features:
from
Token from
Feature activation-0.001
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
Kasich
Token Kasich
Feature activation+0.000
Top resid features:
campaign
Token campaign
Feature activation+0.000
Top resid features:
emphasized
Token emphasized
Feature activation+0.000
Top resid features:
But
Token But
Feature activation+0.002
Top resid features:
sometimes
Token sometimes
Feature activation+0.016
Top resid features:
,
Token,
Feature activation-0.024
Top resid features:
legitimate
Token legitimate
Feature activation-0.009
Top resid features:
email
Token email
Feature activation+0.757
Top resid features:
messages
Token messages
Feature activation+1.303
Top resid features:
are
Token are
Feature activation+0.068
Top resid features:
mistakenly
Token mistakenly
Feature activation+0.035
Top resid features:
filtered
Token filtered
Feature activation+0.081
Top resid features:
as
Token as
Feature activation+0.043
Top resid features:
spam
Token spam
Feature activation+0.056
Top resid features:
an
Token an
Feature activation+0.048
Top resid features:
IP
Token IP
Feature activation+0.066
Top resid features:
address
Token address
Feature activation+0.049
Top resid features:
is
Token is
Feature activation+0.033
Top resid features:
sending
Token sending
Feature activation-0.051
Top resid features:
emails
Token emails
Feature activation+2.329
Top resid features:
which
Token which
Feature activation+0.013
Top resid features:
are
Token are
Feature activation+0.030
Top resid features:
perceived
Token perceived
Feature activation-0.016
Top resid features:
to
Token to
Feature activation-0.010
Top resid features:
be
Token be
Feature activation+0.029
Top resid features:
pitches
Token pitches
Feature activation+0.092
Top resid features:
that
Token that
Feature activation+0.017
Top resid features:
were
Token were
Feature activation+0.092
Top resid features:
e
Token e
Feature activation+0.091
Top resid features:
-
Token-
Feature activation+0.052
Top resid features:
mail
Tokenmail
Feature activation+3.104
Top resid features:
ed
Tokened
Feature activation-0.339
Top resid features:
daily
Token daily
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
national
Token national
Feature activation+0.000
Top resid features:
email
Token email
Feature activation+1.078
Top resid features:
at
Token at
Feature activation+0.052
Top resid features:
This
Token This
Feature activation+0.141
Top resid features:
e
Token e
Feature activation+0.149
Top resid features:
-
Token-
Feature activation+0.056
Top resid features:
mail
Tokenmail
Feature activation+1.765
Top resid features:
address
Token address
Feature activation-0.520
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
being
Token being
Feature activation+0.000
Top resid features:
protected
Token protected
Feature activation+0.000
Top resid features:
from
Token from
Feature activation+0.000
Top resid features:
human
Token human
Feature activation-0.004
Top resid features:
reality
Token reality
Feature activation-0.003
Top resid features:
.
Token.
Feature activation+0.004
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.035
Top resid features:
This
TokenThis
Feature activation+0.023
Top resid features:
email
Token email
Feature activation+3.057
Top resid features:
has
Token has
Feature activation-0.022
Top resid features:
also
Token also
Feature activation+0.024
Top resid features:
been
Token been
Feature activation-0.009
Top resid features:
verified
Token verified
Feature activation+0.011
Top resid features:
by
Token by
Feature activation-0.022
Top resid features:
.
Token.
Feature activation-0.008
Top resid features:
The
Token The
Feature activation+0.029
Top resid features:
following
Token following
Feature activation+0.050
Top resid features:
is
Token is
Feature activation+0.089
Top resid features:
an
Token an
Feature activation+0.073
Top resid features:
email
Token email
Feature activation+2.792
Top resid features:
from
Token from
Feature activation-0.011
Top resid features:
Terry
Token Terry
Feature activation+0.000
Top resid features:
Kramer
Token Kramer
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
But
TokenBut
Feature activation+0.019
Top resid features:
then
Token then
Feature activation+0.026
Top resid features:
I
Token I
Feature activation+0.009
Top resid features:
got
Token got
Feature activation+0.021
Top resid features:
an
Token an
Feature activation+0.074
Top resid features:
email
Token email
Feature activation+2.052
Top resid features:
two
Token two
Feature activation+0.051
Top resid features:
days
Token days
Feature activation-0.005
Top resid features:
after
Token after
Feature activation+0.009
Top resid features:
Sign
Token Sign
Feature activation+0.024
Top resid features:
ing
Tokening
Feature activation+0.035
Top resid features:
was
Token was
Feature activation+0.028
Top resid features:
able
Token able
Feature activation+0.008
Top resid features:
was
Token was
Feature activation+0.026
Top resid features:
able
Token able
Feature activation+0.007
Top resid features:
to
Token to
Feature activation+0.001
Top resid features:
email
Token email
Feature activation+1.446
Top resid features:
back
Token back
Feature activation+0.010
Top resid features:
the
Token the
Feature activation+0.023
Top resid features:
contract
Token contract
Feature activation+0.033
Top resid features:
to
Token to
Feature activation-0.001
Top resid features:
K
Token K
Feature activation+0.006
Top resid features:
was
Token was
Feature activation+0.065
Top resid features:
able
Token able
Feature activation+0.019
Top resid features:
was
Token was
Feature activation+0.059
Top resid features:
able
Token able
Feature activation+0.015
Top resid features:
to
Token to
Feature activation-0.020
Top resid features:
email
Token email
Feature activation+2.848
Top resid features:
back
Token back
Feature activation+0.035
Top resid features:
the
Token the
Feature activation+0.031
Top resid features:
contract
Token contract
Feature activation+0.023
Top resid features:
to
Token to
Feature activation-0.040
Top resid features:
K
Token K
Feature activation-0.016
Top resid features:

Decoder Weights Distribution

Head 0: 0.04

Head 1: 0.03

Head 2: 0.03

Head 3: 0.01

Head 4: 0.04

Head 5: 0.08

Head 6: 0.55

Head 7: 0.05

Head 8: 0.02

Head 9: 0.02

Head 10: 0.03

Head 11: 0.08

Positive logits

inbox2.56

Emails2.46

replies2.29

CLASSIFIED2.28

reply2.26

emails2.26

invitations2.24

mail2.20

Subscribe2.18

correspondence2.16

encrypted2.13

fax2.13

forwarded2.12

email2.10

Email2.05

Mail2.04

subscriptions2.00

answ2.00

UNCLASSIFIED1.95

notifications1.94

Negative logits

Rally-1.87

rally-1.85

Kru-1.80

torch-1.79

uay-1.73

sto-1.70

tanks-1.69

measure-1.69

choke-1.68

displacement-1.67

Pit-1.65

Pike-1.63

run-1.62

unequal-1.61

oppressed-1.60

rac-1.59

bulldo-1.58

deform-1.56

BIP-1.56

tank-1.56

INTERVAL 2.899 - 3.221
CONTAINS 0.000%

.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.000
sent
Token sent
Feature activation+3.115
to
Token to
Feature activation+1.119
AZ
Token AZ
Feature activation+0.000
B
TokenB
Feature activation+0.000
Partners
Token Partners
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.138
sent
Token sent
Feature activation+3.013
to
Token to
Feature activation+1.374
Mur
Token Mur
Feature activation+0.000
thy
Tokenthy
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
comment
Token comment
Feature activation+0.000
project
Token project
Feature activation+0.170
failure
Token failure
Feature activation+0.313
case
Token case
Feature activation+0.362
,
Token,
Feature activation+0.000
was
Token was
Feature activation+0.000
sent
Token sent
Feature activation+3.221
by
Token by
Feature activation+0.328
one
Token one
Feature activation+0.000
programmer
Token programmer
Feature activation+0.599
to
Token to
Feature activation+0.413
the
Token the
Feature activation+0.000

INTERVAL 2.577 - 2.899
CONTAINS 0.000%

of
Token of
Feature activation+0.000
email
Token email
Feature activation+0.000
filters
Token filters
Feature activation+0.340
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
sender
Token sender
Feature activation+2.731
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.432
reputation
Token reputation
Feature activation+0.000
causes
Token causes
Feature activation+0.000

INTERVAL 2.255 - 2.577
CONTAINS 0.001%

,
Token,
Feature activation+0.000
she
Token she
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.000
ed
Tokened
Feature activation+2.459
Mori
Token Mori
Feature activation+0.000
arity
Tokenarity
Feature activation+0.000
the
Token the
Feature activation+0.000
36
Token 36
Feature activation+0.000
-
Token-
Feature activation+0.000
B
TokenB
Feature activation+0.000
Partners
Token Partners
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
comment
Token comment
Feature activation+0.000
went
Token went
Feature activation+0.000
unanswered
Token unanswered
Feature activation+2.556
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.138
he
Token he
Feature activation+0.000
addressed
Token addressed
Feature activation+0.000
an
Token an
Feature activation+0.000
email
Token email
Feature activation+0.000
he
Token he
Feature activation+0.000
sent
Token sent
Feature activation+2.487
to
Token to
Feature activation+0.518
Hillary
Token Hillary
Feature activation+0.000
Clinton
Token Clinton
Feature activation+0.000
telling
Token telling
Feature activation+0.000
her
Token her
Feature activation+0.000
several
Token several
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mails
Tokenmails
Feature activation+0.286
I
Token I
Feature activation+0.205
received
Token received
Feature activation+2.462
at
Token at
Feature activation+0.242
the
Token the
Feature activation+0.000
time
Token time
Feature activation+0.000
.
Token.
Feature activation+0.000
But
Token But
Feature activation+0.000
other
Token other
Feature activation+0.000
principals
Token principals
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
district
Token district
Feature activation+0.000
received
Token received
Feature activation+2.531
email
Token email
Feature activation+0.000
threats
Token threats
Feature activation+0.000
,
Token,
Feature activation+0.000
Mun
Token Mun
Feature activation+0.000
ro
Tokenro
Feature activation+0.000

INTERVAL 1.932 - 2.255
CONTAINS 0.001%

conservatives
Token conservatives
Feature activation+0.000
,
Token,
Feature activation+0.000
supporters
Token supporters
Feature activation+0.000
who
Token who
Feature activation+0.000
reliably
Token reliably
Feature activation+0.000
respond
Token respond
Feature activation+2.223
in
Token in
Feature activation+0.000
large
Token large
Feature activation+0.000
numbers
Token numbers
Feature activation+0.000
of
Token of
Feature activation+0.000
small
Token small
Feature activation+0.000
mistakenly
Token mistakenly
Feature activation+0.000
filtered
Token filtered
Feature activation+0.241
as
Token as
Feature activation+0.000
spam
Token spam
Feature activation+0.000
or
Token or
Feature activation+0.000
sent
Token sent
Feature activation+2.103
into
Token into
Feature activation+0.000
junk
Token junk
Feature activation+0.000
box
Token box
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
accompanying
Token accompanying
Feature activation+0.000
email
Token email
Feature activation+0.000
from
Token from
Feature activation+2.138
the
Token the
Feature activation+0.000
Kasich
Token Kasich
Feature activation+0.000
campaign
Token campaign
Feature activation+0.000
emphasized
Token emphasized
Feature activation+0.000
that
Token that
Feature activation+0.000
that
Token that
Feature activation+0.000
were
Token were
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.000
ed
Tokened
Feature activation+2.059
daily
Token daily
Feature activation+0.000
to
Token to
Feature activation+0.353
a
Token a
Feature activation+0.000
national
Token national
Feature activation+0.000
network
Token network
Feature activation+0.000
at
Token at
Feature activation+0.550
This
Token This
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.340
address
Token address
Feature activation+2.052
is
Token is
Feature activation+0.023
being
Token being
Feature activation+0.000
protected
Token protected
Feature activation+0.293
from
Token from
Feature activation+0.968
sp
Token sp
Feature activation+0.000

INTERVAL 1.610 - 1.932
CONTAINS 0.001%

wd
Tokenwd
Feature activation+0.000
:
Token:
Feature activation+0.000
Jeb
Token Jeb
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
From
TokenFrom
Feature activation+1.930
:
Token:
Feature activation+0.000
jp
Tokenjp
Feature activation+0.000
66
Token66
Feature activation+0.000
@
Token@
Feature activation+1.416
hillary
Tokenhillary
Feature activation+0.000
back
Token back
Feature activation+0.136
for
Token for
Feature activation+0.000
him
Token him
Feature activation+0.000
and
Token and
Feature activation+0.000
he
Token he
Feature activation+0.000
replied
Token replied
Feature activation+1.802
,
Token,
Feature activation+0.000
"
Token "
Feature activation+0.000
If
TokenIf
Feature activation+0.225
can
Token can
Feature activation+0.000
coach
Token coach
Feature activation+0.000
K
Token K
Feature activation+0.000
OT
TokenOT
Feature activation+0.000
C
TokenC
Feature activation+0.000
and
Token and
Feature activation+0.000
he
Token he
Feature activation+0.000
replied
Token replied
Feature activation+1.695
"
Token "
Feature activation+0.000
No
TokenNo
Feature activation+0.000
,
Token,
Feature activation+0.000
because
Token because
Feature activation+0.000
I
Token I
Feature activation+0.000
an
Token an
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.000
or
Token or
Feature activation+0.000
contact
Token contact
Feature activation+1.665
him
Token him
Feature activation+0.000
via
Token via
Feature activation+0.122
Twitter
Token Twitter
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
!
Token!
Feature activation+0.000
Sorry
Token Sorry
Feature activation+0.000
for
Token for
Feature activation+0.000
my
Token my
Feature activation+0.000
poor
Token poor
Feature activation+0.000
response
Token response
Feature activation+1.844
.
Token.
Feature activation+0.000
Would
Token Would
Feature activation+0.000
have
Token have
Feature activation+0.000
love
Token love
Feature activation+0.000
to
Token to
Feature activation+0.175

INTERVAL 1.288 - 1.610
CONTAINS 0.001%

and
Token and
Feature activation+0.000
scrub
Token scrub
Feature activation+0.000
bed
Tokenbed
Feature activation+0.000
of
Token of
Feature activation+0.000
any
Token any
Feature activation+0.000
sensitive
Token sensitive
Feature activation+1.389
data
Token data
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
em
Token em
Feature activation+0.000
ma
Tokenma
Feature activation+0.000
.
Token.
Feature activation+0.000
kn
Tokenkn
Feature activation+0.000
ights
Tokenights
Feature activation+0.000
@
Token@
Feature activation+1.531
arch
Tokenarch
Feature activation+0.000
ant
Tokenant
Feature activation+0.000
.
Token.
Feature activation+0.000
co
Tokenco
Feature activation+0.000
.
Token.
Feature activation+0.000
you
Token you
Feature activation+0.000
create
Token create
Feature activation+0.000
unique
Token unique
Feature activation+0.000
,
Token,
Feature activation+0.000
disposable
Token disposable
Feature activation+0.000
inbox
Token inbox
Feature activation+1.479
es
Tokenes
Feature activation+0.628
which
Token which
Feature activation+0.000
delete
Token delete
Feature activation+0.449
themselves
Token themselves
Feature activation+0.000
after
Token after
Feature activation+0.029
;
Token;
Feature activation+0.000
it
Token it
Feature activation+0.000
was
Token was
Feature activation+0.000
an
Token an
Feature activation+0.000
AOL
Token AOL
Feature activation+0.019
address
Token address
Feature activation+1.433
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
GAME
TokenGAME
Feature activation+0.000
ON
Token ON
Feature activation+0.171
ts
Tokents
Feature activation+0.000
?
Token?
Feature activation+0.000
Email
Token Email
Feature activation+0.000
game
Token game
Feature activation+0.000
central
Tokencentral
Feature activation+0.000
@
Token@
Feature activation+1.524
uk
Tokenuk
Feature activation+0.000
met
Tokenmet
Feature activation+0.000
ro
Tokenro
Feature activation+0.000
.
Token.
Feature activation+0.000
co
Tokenco
Feature activation+0.000

INTERVAL 0.966 - 1.288
CONTAINS 0.003%

as
Token as
Feature activation+0.000
your
Token your
Feature activation+0.000
phone
Token phone
Feature activation+0.000
number
Token number
Feature activation+0.000
,
Token,
Feature activation+0.000
postal
Token postal
Feature activation+1.052
address
Token address
Feature activation+0.000
,
Token,
Feature activation+0.000
your
Token your
Feature activation+0.000
Facebook
Token Facebook
Feature activation+0.000
identity
Token identity
Feature activation+0.000
Email
TokenEmail
Feature activation+0.000
:
Token:
Feature activation+0.000
dr
Token dr
Feature activation+0.000
aths
Tokenaths
Feature activation+0.000
ack
Tokenack
Feature activation+0.000
@
Token@
Feature activation+1.115
ci
Tokenci
Feature activation+0.000
.
Token.
Feature activation+0.000
l
Tokenl
Feature activation+0.000
ud
Tokenud
Feature activation+0.000
ington
Tokenington
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
An
TokenAn
Feature activation+0.000
email
Token email
Feature activation+0.000
sent
Token sent
Feature activation+3.115
to
Token to
Feature activation+1.119
AZ
Token AZ
Feature activation+0.000
B
TokenB
Feature activation+0.000
Partners
Token Partners
Feature activation+0.000
seeking
Token seeking
Feature activation+0.000
comment
Token comment
Feature activation+0.000
phone
Token phone
Feature activation+0.000
and
Token and
Feature activation+0.000
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mail
Tokenmail
Feature activation+0.000
inbox
Token inbox
Feature activation+1.205
went
Token went
Feature activation+0.000
absolutely
Token absolutely
Feature activation+0.000
b
Token b
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
erk
Tokenerk
Feature activation+0.000
:
Token:
Feature activation+0.000
v
Token v
Feature activation+0.000
t
Tokent
Feature activation+0.000
af
Tokenaf
Feature activation+0.000
ur
Tokenur
Feature activation+0.000
@
Token@
Feature activation+1.212
sf
Tokensf
Feature activation+0.000
chron
Tokenchron
Feature activation+0.000
icle
Tokenicle
Feature activation+0.000
.
Token.
Feature activation+0.000
com
Tokencom
Feature activation+0.000

INTERVAL 0.644 - 0.966
CONTAINS 0.004%

the
Token the
Feature activation+0.000
address
Token address
Feature activation+0.000
,
Token,
Feature activation+0.000
you
Token you
Feature activation+0.007
can
Token can
Feature activation+0.000
send
Token send
Feature activation+0.900
data
Token data
Feature activation+0.000
to
Token to
Feature activation+0.154
it
Token it
Feature activation+0.000
.
Token.
Feature activation+0.000
Unlike
Token Unlike
Feature activation+0.000
or
Token or
Feature activation+0.000
organization
Token organization
Feature activation+0.601
.
Token.
Feature activation+0.000
One
Token One
Feature activation+0.000
of
Token of
Feature activation+0.000
my
Token my
Feature activation+0.691
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000
time
Tokentime
Feature activation+0.012
favorite
Token favorite
Feature activation+0.000
e
Token e
Feature activation+0.368
emailed
Token emailed
Feature activation+0.000
a
Token a
Feature activation+0.000
letter
Token letter
Feature activation+0.380
of
Token of
Feature activation+0.000
apology
Token apology
Feature activation+0.000
to
Token to
Feature activation+0.680
Student
Token Student
Feature activation+0.000
Government
Token Government
Feature activation+0.000
Association
Token Association
Feature activation+0.000
President
Token President
Feature activation+0.000
John
Token John
Feature activation+0.000
dialect
Token dialect
Feature activation+0.000
s
Tokens
Feature activation+0.000
,
Token,
Feature activation+0.000
according
Token according
Feature activation+0.000
to
Token to
Feature activation+0.000
Die
Token Die
Feature activation+0.797
W
Token W
Feature activation+0.000
elt
Tokenelt
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
I
Token I
Feature activation+0.112
thought
Token thought
Feature activation+0.000
,
Token,
Feature activation+0.000
well
Token well
Feature activation+0.000
,
Token,
Feature activation+0.000
at
Token at
Feature activation+0.730
least
Token least
Feature activation+0.000
these
Token these
Feature activation+0.000
pictures
Token pictures
Feature activation+0.000
capture
Token capture
Feature activation+0.000
an
Token an
Feature activation+0.000

INTERVAL 0.322 - 0.644
CONTAINS 0.014%

Abedin
Token Abedin
Feature activation+0.000
>>>
Token >>>
Feature activation+0.000
>>
Token>>
Feature activation+0.000
Subject
Token Subject
Feature activation+0.700
:
Token:
Feature activation+0.000
Re
Token Re
Feature activation+0.441
:
Token:
Feature activation+0.000
Traff
Token Traff
Feature activation+0.000
icking
Tokenicking
Feature activation+0.000
//
Token//
Feature activation+0.000
L
TokenL
Feature activation+0.000
had
Token had
Feature activation+0.000
sent
Token sent
Feature activation+1.279
e
Token e
Feature activation+0.000
-
Token-
Feature activation+0.000
mails
Tokenmails
Feature activation+0.000
to
Token to
Feature activation+0.334
a
Token a
Feature activation+0.000
televised
Token televised
Feature activation+0.000
debate
Token debate
Feature activation+0.000
backing
Token backing
Feature activation+0.000
a
Token a
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Brief
Token Brief
Feature activation+0.000
Newsletter
Token Newsletter
Feature activation+0.000
Sign
Token Sign
Feature activation+0.353
up
Token up
Feature activation+0.000
to
Token to
Feature activation+0.000
receive
Token receive
Feature activation+0.268
the
Token the
Feature activation+0.000
top
Token top
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Brief
Token Brief
Feature activation+0.000
Newsletter
Token Newsletter
Feature activation+0.000
Sign
Token Sign
Feature activation+0.425
up
Token up
Feature activation+0.000
to
Token to
Feature activation+0.105
receive
Token receive
Feature activation+0.383
the
Token the
Feature activation+0.000
top
Token top
Feature activation+0.000
or
Token or
Feature activation+0.000
"
Token "
Feature activation+0.000
cap
Tokencap
Feature activation+0.000
abilities
Tokenabilities
Feature activation+0.000
"
Token"
Feature activation+0.000
to
Token to
Feature activation+0.327
abuse
Token abuse
Feature activation+0.000
the
Token the
Feature activation+0.000
computer
Token computer
Feature activation+0.000
science
Token science
Feature activation+0.000
ling
Token ling
Feature activation+0.000

INTERVAL 0.000 - 0.322
CONTAINS 99.974%

developed
Token developed
Feature activation+0.000
and
Token and
Feature activation+0.000
launched
Token launched
Feature activation+0.000
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
mere
Token mere
Feature activation+0.000
15
Token 15
Feature activation+0.000
months
Token months
Feature activation+0.000
,
Token,
Feature activation+0.000
making
Token making
Feature activation+0.000
today
Token today
Feature activation+0.000
it
Token it
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
unacceptable
Token unacceptable
Feature activation+0.000
,
Token,
Feature activation+0.000
including
Token including
Feature activation+0.000
myself
Token myself
Feature activation+0.000
.
Token.
Feature activation+0.000
Those
Token Those
Feature activation+0.000
kinds
Token kinds
Feature activation+0.000
co
Tokenco
Feature activation+0.000
:
Token:
Feature activation+0.000
Probably
Token Probably
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
because
Token because
Feature activation+0.000
speed
Token speed
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
are
Token are
Feature activation+0.000
an
Token an
Feature activation+0.000
A
Token A
Feature activation+0.000
did
Token did
Feature activation+0.000
women
Token women
Feature activation+0.000
(
Token (
Feature activation+0.000
49
Token49
Feature activation+0.000
percent
Token percent
Feature activation+0.000
).
Token).
Feature activation+0.000
While
Token While
Feature activation+0.000
support
Token support
Feature activation+0.000
rates
Token rates
Feature activation+0.000
for
Token for
Feature activation+0.000
people
Token people
Feature activation+0.000
VR
Token VR
Feature activation+0.000
aims
Token aims
Feature activation+0.000
to
Token to
Feature activation+0.000
challenge
Token challenge
Feature activation+0.000
the
Token the
Feature activation+0.000
first
Token first
Feature activation+0.000
-
Token-
Feature activation+0.000
person
Tokenperson
Feature activation+0.000
shooter
Token shooter
Feature activation+0.000
(
Token (
Feature activation+0.000
F
TokenF
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 2 in H1.6: (feature 5425

TOP ACTIVATIONS
MAX = 2.344

service
Token service
Feature activation+0.000
while
Token while
Feature activation+0.000
turning
Token turning
Feature activation+0.000
a
Token a
Feature activation+0.000
blind
Token blind
Feature activation+0.000
eye
Token eye
Feature activation+2.344
--
Token --
Feature activation+0.000
or
Token or
Feature activation+0.000
maybe
Token maybe
Feature activation+0.000
that
Token that
Feature activation+0.000
's
Token's
Feature activation+0.000
,
Token,
Feature activation+0.000
nuance
Token nuance
Feature activation+0.000
and
Token and
Feature activation+0.000
turning
Token turning
Feature activation+0.000
blind
Token blind
Feature activation+0.000
eyes
Token eyes
Feature activation+2.109
.
Token.
Feature activation+0.000
So
Token So
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
lot
Token lot
Feature activation+0.000
your
Token your
Feature activation+0.000
computer
Token computer
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
blind
Tokenblind
Feature activation+0.000
ly
Tokenly
Feature activation+1.353
,
Token,
Feature activation+0.000
unconsciously
Token unconsciously
Feature activation+0.222
,
Token,
Feature activation+0.000
naturally
Token naturally
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
first
Token first
Feature activation+0.000
issue
Token issue
Feature activation+0.000
,
Token,
Feature activation+0.000
Spl
Token Spl
Feature activation+0.000
inter
Tokeninter
Feature activation+0.000
sees
Token sees
Feature activation+1.112
the
Token the
Feature activation+0.000
can
Token can
Feature activation+0.000
ister
Tokenister
Feature activation+0.000
strike
Token strike
Feature activation+0.000
a
Token a
Feature activation+0.000
such
Token such
Feature activation+0.000
as
Token as
Feature activation+0.000
the
Token the
Feature activation+0.000
CEO
Token CEO
Feature activation+0.000
dece
Token dece
Feature activation+0.000
iving
Tokeniving
Feature activation+1.025
the
Token the
Feature activation+0.000
shareholders
Token shareholders
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
three
Token three
Feature activation+0.000
-
Token-
Feature activation+0.000
hour
Tokenhour
Feature activation+0.000
ab
Token ab
Feature activation+0.000
rid
Tokenrid
Feature activation+0.000
ged
Tokenged
Feature activation+0.919
version
Token version
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
text
Token text
Feature activation+0.000
over
Token over
Feature activation+0.000
separating
Token separating
Feature activation+0.000
the
Token the
Feature activation+0.000
chamber
Token chamber
Feature activation+0.000
from
Token from
Feature activation+0.000
the
Token the
Feature activation+0.000
viewing
Token viewing
Feature activation+0.654
room
Token room
Feature activation+0.000
were
Token were
Feature activation+0.000
lowered
Token lowered
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
supports
Token supports
Feature activation+0.000
,
Token,
Feature activation+0.000
l
Token l
Feature activation+0.000
adders
Tokenadders
Feature activation+0.000
and
Token and
Feature activation+0.000
steps
Token steps
Feature activation+0.499
,
Token,
Feature activation+0.000
perform
Token perform
Feature activation+0.000
a
Token a
Feature activation+0.000
mechanical
Token mechanical
Feature activation+0.000
ballet
Token ballet
Feature activation+0.000
from
Token from
Feature activation+0.000
members
Token members
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
PBS
Token PBS
Feature activation+0.000
show
Token show
Feature activation+0.493
"
Token "
Feature activation+0.000
This
TokenThis
Feature activation+0.000
Old
Token Old
Feature activation+0.000
House
Token House
Feature activation+0.000
",
Token",
Feature activation+0.000
opportunity
Token opportunity
Feature activation+0.000
but
Token but
Feature activation+0.000
I
Token I
Feature activation+0.000
am
Token am
Feature activation+0.000
not
Token not
Feature activation+0.000
blind
Token blind
Feature activation+0.434
to
Token to
Feature activation+0.000
other
Token other
Feature activation+0.000
types
Token types
Feature activation+0.000
of
Token of
Feature activation+0.000
alternative
Token alternative
Feature activation+0.000
but
Token but
Feature activation+0.000
there
Token there
Feature activation+0.000
's
Token's
Feature activation+0.000
no
Token no
Feature activation+0.000
mist
Token mist
Feature activation+0.000
aking
Tokenaking
Feature activation+0.418
Wisconsin
Token Wisconsin
Feature activation+0.000
as
Token as
Feature activation+0.000
a
Token a
Feature activation+0.000
contender
Token contender
Feature activation+0.000
to
Token to
Feature activation+0.000
act
Token act
Feature activation+0.000
of
Token of
Feature activation+0.000
sacrifice
Token sacrifice
Feature activation+0.000
/
Token/
Feature activation+0.000
off
Tokenoff
Feature activation+0.000
ering
Tokenering
Feature activation+0.399
/
Token/
Feature activation+0.000
w
Tokenw
Feature activation+0.000
orship
Tokenorship
Feature activation+0.000
ing
Tokening
Feature activation+0.000
.
Token.
Feature activation+0.000
way
Token way
Feature activation+0.000
the
Token the
Feature activation+0.000
underlying
Token underlying
Feature activation+0.000
ideological
Token ideological
Feature activation+0.000
blind
Token blind
Feature activation+0.000
ers
Tokeners
Feature activation+0.389
that
Token that
Feature activation+0.000
guide
Token guide
Feature activation+0.000
the
Token the
Feature activation+0.000
newspaper
Token newspaper
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
our
Token our
Feature activation+0.000
current
Token current
Feature activation+0.000
situation
Token situation
Feature activation+0.000
to
Token to
Feature activation+0.000
standing
Token standing
Feature activation+0.000
blind
Token blind
Feature activation+0.363
fold
Tokenfold
Feature activation+0.357
ed
Tokened
Feature activation+0.000
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
batting
Token batting
Feature activation+0.000
current
Token current
Feature activation+0.000
situation
Token situation
Feature activation+0.000
to
Token to
Feature activation+0.000
standing
Token standing
Feature activation+0.000
blind
Token blind
Feature activation+0.363
fold
Tokenfold
Feature activation+0.357
ed
Tokened
Feature activation+0.000
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
batting
Token batting
Feature activation+0.000
cage
Token cage
Feature activation+0.000
ative
Tokenative
Feature activation+0.000
,
Token,
Feature activation+0.000
no
Token no
Feature activation+0.000
-
Token-
Feature activation+0.000
hold
Tokenhold
Feature activation+0.000
s
Tokens
Feature activation+0.312
-
Token-
Feature activation+0.000
bar
Tokenbar
Feature activation+0.000
red
Tokenred
Feature activation+0.000
politics
Token politics
Feature activation+0.000
that
Token that
Feature activation+0.000
straight
Token straight
Feature activation+0.000
-
Token-
Feature activation+0.000
faced
Tokenfaced
Feature activation+0.000
reading
Token reading
Feature activation+0.000
of
Token of
Feature activation+0.000
cause
Token cause
Feature activation+0.286
of
Token of
Feature activation+0.000
death
Token death
Feature activation+0.173
:
Token:
Feature activation+0.000
demon
Token demon
Feature activation+0.000
is
Token is
Feature activation+0.000
Section
TokenSection
Feature activation+0.000
8
Token 8
Feature activation+0.000
:
Token:
Feature activation+0.000
Pre
Token Pre
Feature activation+0.000
jud
Tokenjud
Feature activation+0.000
ice
Tokenice
Feature activation+0.240
(
Token (
Feature activation+0.000
Studio
TokenStudio
Feature activation+0.000
Closed
Token Closed
Feature activation+0.000
)
Token)
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
blind
Tokenblind
Feature activation+0.000
ly
Tokenly
Feature activation+1.353
,
Token,
Feature activation+0.000
unconsciously
Token unconsciously
Feature activation+0.222
,
Token,
Feature activation+0.000
naturally
Token naturally
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
âĢĶ
Token âĢĶ
Feature activation+0.000
-
Token-
Feature activation+0.000
sounding
Tokensounding
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
smart
Tokensmart
Feature activation+0.000
ness
Tokenness
Feature activation+0.189
.
Token.
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
He
Token He
Feature activation+0.000
insists
Token insists
Feature activation+0.000

Top DFA by src position
MAX = 3.790

lip
Token lip
Feature activation+0.025
Top resid features:
service
Token service
Feature activation-0.000
Top resid features:
while
Token while
Feature activation-0.075
Top resid features:
turning
Token turning
Feature activation+0.014
Top resid features:
a
Token a
Feature activation-0.165
Top resid features:
blind
Token blind
Feature activation+3.790
Top resid features:
eye
Token eye
Feature activation+0.272
Top resid features:
--
Token --
Feature activation+0.000
Top resid features:
or
Token or
Feature activation+0.000
Top resid features:
maybe
Token maybe
Feature activation+0.000
Top resid features:
that
Token that
Feature activation+0.000
Top resid features:
ambiguity
Token ambiguity
Feature activation+0.110
Top resid features:
,
Token,
Feature activation-0.127
Top resid features:
nuance
Token nuance
Feature activation-0.009
Top resid features:
and
Token and
Feature activation-0.165
Top resid features:
turning
Token turning
Feature activation-0.075
Top resid features:
blind
Token blind
Feature activation+3.395
Top resid features:
eyes
Token eyes
Feature activation+0.244
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
So
Token So
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
control
Token control
Feature activation+0.111
Top resid features:
your
Token your
Feature activation-0.089
Top resid features:
computer
Token computer
Feature activation-0.163
Top resid features:
âĢ
Token âĢ
Feature activation-0.072
Top resid features:
ľ
Tokenľ
Feature activation+0.118
Top resid features:
blind
Tokenblind
Feature activation+3.072
Top resid features:
ly
Tokenly
Feature activation-0.080
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
unconsciously
Token unconsciously
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
naturally
Token naturally
Feature activation+0.000
Top resid features:
the
Token the
Feature activation-0.024
Top resid features:
traffic
Token traffic
Feature activation+0.010
Top resid features:
accident
Token accident
Feature activation+0.007
Top resid features:
between
Token between
Feature activation-0.001
Top resid features:
a
Token a
Feature activation-0.020
Top resid features:
blind
Token blind
Feature activation+2.323
Top resid features:
man
Token man
Feature activation-0.006
Top resid features:
and
Token and
Feature activation+0.002
Top resid features:
a
Token a
Feature activation-0.019
Top resid features:
truck
Token truck
Feature activation-0.004
Top resid features:
carrying
Token carrying
Feature activation+0.012
Top resid features:
,
Token,
Feature activation-0.091
Top resid features:
such
Token such
Feature activation+0.077
Top resid features:
as
Token as
Feature activation+0.034
Top resid features:
the
Token the
Feature activation-0.036
Top resid features:
CEO
Token CEO
Feature activation-0.055
Top resid features:
dece
Token dece
Feature activation+2.300
Top resid features:
iving
Tokeniving
Feature activation+0.255
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
shareholders
Token shareholders
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
or
Token or
Feature activation+0.000
Top resid features:
a
Token a
Feature activation-0.125
Top resid features:
three
Token three
Feature activation-0.036
Top resid features:
-
Token-
Feature activation-0.061
Top resid features:
hour
Tokenhour
Feature activation-0.040
Top resid features:
ab
Token ab
Feature activation-0.038
Top resid features:
rid
Tokenrid
Feature activation+2.677
Top resid features:
ged
Tokenged
Feature activation-0.036
Top resid features:
version
Token version
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
text
Token text
Feature activation+0.000
Top resid features:
being
Token being
Feature activation-0.002
Top resid features:
declared
Token declared
Feature activation+0.041
Top resid features:
dead
Token dead
Feature activation+0.008
Top resid features:
,
Token,
Feature activation-0.076
Top resid features:
the
Token the
Feature activation-0.018
Top resid features:
blind
Token blind
Feature activation+1.429
Top resid features:
s
Tokens
Feature activation+0.039
Top resid features:
separating
Token separating
Feature activation+0.041
Top resid features:
the
Token the
Feature activation-0.029
Top resid features:
chamber
Token chamber
Feature activation-0.018
Top resid features:
from
Token from
Feature activation+0.056
Top resid features:
each
Token each
Feature activation-0.003
Top resid features:
a
Token a
Feature activation-0.014
Top resid features:
400
Token 400
Feature activation+0.003
Top resid features:
-
Token-
Feature activation+0.028
Top resid features:
ton
Tokenton
Feature activation-0.019
Top resid features:
maze
Token maze
Feature activation+1.954
Top resid features:
of
Token of
Feature activation+0.042
Top resid features:
cables
Token cables
Feature activation-0.056
Top resid features:
and
Token and
Feature activation-0.037
Top resid features:
supports
Token supports
Feature activation+0.042
Top resid features:
,
Token,
Feature activation-0.173
Top resid features:
help
Token help
Feature activation+0.001
Top resid features:
from
Token from
Feature activation+0.060
Top resid features:
members
Token members
Feature activation-0.008
Top resid features:
of
Token of
Feature activation+0.064
Top resid features:
the
Token the
Feature activation-0.013
Top resid features:
PBS
Token PBS
Feature activation+2.330
Top resid features:
show
Token show
Feature activation-0.042
Top resid features:
"
Token "
Feature activation+0.000
Top resid features:
This
TokenThis
Feature activation+0.000
Top resid features:
Old
Token Old
Feature activation+0.000
Top resid features:
House
Token House
Feature activation+0.000
Top resid features:
opportunity
Token opportunity
Feature activation+0.002
Top resid features:
but
Token but
Feature activation-0.086
Top resid features:
I
Token I
Feature activation-0.047
Top resid features:
am
Token am
Feature activation-0.054
Top resid features:
not
Token not
Feature activation-0.004
Top resid features:
blind
Token blind
Feature activation+1.703
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
other
Token other
Feature activation+0.000
Top resid features:
types
Token types
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
alternative
Token alternative
Feature activation+0.000
Top resid features:
,
Token,
Feature activation-0.046
Top resid features:
but
Token but
Feature activation-0.014
Top resid features:
there
Token there
Feature activation+0.015
Top resid features:
's
Token's
Feature activation+0.002
Top resid features:
no
Token no
Feature activation+0.176
Top resid features:
mist
Token mist
Feature activation+1.732
Top resid features:
aking
Tokenaking
Feature activation-0.040
Top resid features:
Wisconsin
Token Wisconsin
Feature activation+0.000
Top resid features:
as
Token as
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
contender
Token contender
Feature activation+0.000
Top resid features:
the
Token the
Feature activation-0.034
Top resid features:
act
Token act
Feature activation-0.027
Top resid features:
of
Token of
Feature activation+0.032
Top resid features:
sacrifice
Token sacrifice
Feature activation+0.062
Top resid features:
/
Token/
Feature activation-0.282
Top resid features:
off
Tokenoff
Feature activation+1.763
Top resid features:
ering
Tokenering
Feature activation+0.022
Top resid features:
/
Token/
Feature activation+0.000
Top resid features:
w
Tokenw
Feature activation+0.000
Top resid features:
orship
Tokenorship
Feature activation+0.000
Top resid features:
ing
Tokening
Feature activation+0.000
Top resid features:
imentary
Tokenimentary
Feature activation-0.008
Top resid features:
way
Token way
Feature activation+0.003
Top resid features:
the
Token the
Feature activation-0.057
Top resid features:
underlying
Token underlying
Feature activation-0.057
Top resid features:
ideological
Token ideological
Feature activation+0.004
Top resid features:
blind
Token blind
Feature activation+2.160
Top resid features:
ers
Tokeners
Feature activation+0.062
Top resid features:
that
Token that
Feature activation+0.000
Top resid features:
guide
Token guide
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
newspaper
Token newspaper
Feature activation+0.000
Top resid features:
our
Token our
Feature activation-0.033
Top resid features:
current
Token current
Feature activation+0.029
Top resid features:
situation
Token situation
Feature activation+0.042
Top resid features:
to
Token to
Feature activation+0.062
Top resid features:
standing
Token standing
Feature activation-0.026
Top resid features:
blind
Token blind
Feature activation+1.531
Top resid features:
fold
Tokenfold
Feature activation+0.000
Top resid features:
ed
Tokened
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
batting
Token batting
Feature activation+0.000
Top resid features:
our
Token our
Feature activation-0.039
Top resid features:
current
Token current
Feature activation+0.013
Top resid features:
situation
Token situation
Feature activation+0.025
Top resid features:
to
Token to
Feature activation+0.097
Top resid features:
standing
Token standing
Feature activation+0.007
Top resid features:
blind
Token blind
Feature activation+1.445
Top resid features:
fold
Tokenfold
Feature activation-0.045
Top resid features:
ed
Tokened
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
batting
Token batting
Feature activation+0.000
Top resid features:
comb
Token comb
Feature activation+0.038
Top resid features:
ative
Tokenative
Feature activation+0.187
Top resid features:
,
Token,
Feature activation-0.086
Top resid features:
no
Token no
Feature activation+0.097
Top resid features:
-
Token-
Feature activation+0.063
Top resid features:
hold
Tokenhold
Feature activation+1.433
Top resid features:
s
Tokens
Feature activation+0.052
Top resid features:
-
Token-
Feature activation+0.000
Top resid features:
bar
Tokenbar
Feature activation+0.000
Top resid features:
red
Tokenred
Feature activation+0.000
Top resid features:
politics
Token politics
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.709
Top resid features:
.
Token.
Feature activation-0.011
Top resid features:
The
Token The
Feature activation-0.134
Top resid features:
coroner
Token coroner
Feature activation+1.182
Top resid features:
âĢ
TokenâĢ
Feature activation+0.027
Top resid features:
Ļ
TokenĻ
Feature activation+0.036
Top resid features:
s
Tokens
Feature activation+0.023
Top resid features:
straight
Token straight
Feature activation+0.078
Top resid features:
-
Token-
Feature activation+0.043
Top resid features:
Ċ
TokenĊ
Feature activation-0.009
Top resid features:
Section
TokenSection
Feature activation-0.151
Top resid features:
8
Token 8
Feature activation+0.024
Top resid features:
:
Token:
Feature activation-0.179
Top resid features:
Pre
Token Pre
Feature activation-0.229
Top resid features:
jud
Tokenjud
Feature activation+1.899
Top resid features:
ice
Tokenice
Feature activation-0.044
Top resid features:
(
Token (
Feature activation+0.000
Top resid features:
Studio
TokenStudio
Feature activation+0.000
Top resid features:
Closed
Token Closed
Feature activation+0.000
Top resid features:
)
Token)
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.018
Top resid features:
control
Token control
Feature activation+0.118
Top resid features:
your
Token your
Feature activation-0.083
Top resid features:
computer
Token computer
Feature activation-0.096
Top resid features:
âĢ
Token âĢ
Feature activation+0.056
Top resid features:
ľ
Tokenľ
Feature activation+0.049
Top resid features:
ored
Tokenored
Feature activation+0.051
Top resid features:
-
Token-
Feature activation+0.031
Top resid features:
sounding
Tokensounding
Feature activation-0.047
Top resid features:
âĢ
Token âĢ
Feature activation-0.007
Top resid features:
ľ
Tokenľ
Feature activation+0.038
Top resid features:
smart
Tokensmart
Feature activation+1.130
Top resid features:
ness
Tokenness
Feature activation+0.178
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ŀ
TokenĿ
Feature activation+0.000
Top resid features:
He
Token He
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.06

Head 1: 0.05

Head 2: 0.04

Head 3: 0.02

Head 4: 0.04

Head 5: 0.05

Head 6: 0.54

Head 7: 0.05

Head 8: 0.03

Head 9: 0.03

Head 10: 0.03

Head 11: 0.06

Positive logits

ensical2.08

Paraly2.06

dart2.00

monkey1.90

monkeys1.88

blinded1.86

bunny1.86

clown1.81

juggling1.76

naïve1.72

Stamford1.70

dodged1.68

Blazers1.65

broom1.63

vaccine1.63

paralyzed1.61

impartial1.59

ears1.58

owl1.58

sled1.58

Negative logits

ZI-1.87

minster-1.87

amura-1.81

auri-1.74

arez-1.74

jen-1.73

Reloaded-1.72

nants-1.72

dat-1.70

ieri-1.68

custom-1.68

tera-1.67

JO-1.63

Ger-1.62

itant-1.61

Quote-1.61

installs-1.61

xit-1.59

aeda-1.58

nikov-1.57

INTERVAL 2.110 - 2.344
CONTAINS 0.000%

service
Token service
Feature activation+0.000
while
Token while
Feature activation+0.000
turning
Token turning
Feature activation+0.000
a
Token a
Feature activation+0.000
blind
Token blind
Feature activation+0.000
eye
Token eye
Feature activation+2.344
--
Token --
Feature activation+0.000
or
Token or
Feature activation+0.000
maybe
Token maybe
Feature activation+0.000
that
Token that
Feature activation+0.000
's
Token's
Feature activation+0.000

INTERVAL 1.875 - 2.110
CONTAINS 0.000%

,
Token,
Feature activation+0.000
nuance
Token nuance
Feature activation+0.000
and
Token and
Feature activation+0.000
turning
Token turning
Feature activation+0.000
blind
Token blind
Feature activation+0.000
eyes
Token eyes
Feature activation+2.109
.
Token.
Feature activation+0.000
So
Token So
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
lot
Token lot
Feature activation+0.000

INTERVAL 1.641 - 1.875
CONTAINS 0.000%

INTERVAL 1.406 - 1.641
CONTAINS 0.000%

INTERVAL 1.172 - 1.406
CONTAINS 0.000%

your
Token your
Feature activation+0.000
computer
Token computer
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
blind
Tokenblind
Feature activation+0.000
ly
Tokenly
Feature activation+1.353
,
Token,
Feature activation+0.000
unconsciously
Token unconsciously
Feature activation+0.222
,
Token,
Feature activation+0.000
naturally
Token naturally
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000

INTERVAL 0.938 - 1.172
CONTAINS 0.000%

such
Token such
Feature activation+0.000
as
Token as
Feature activation+0.000
the
Token the
Feature activation+0.000
CEO
Token CEO
Feature activation+0.000
dece
Token dece
Feature activation+0.000
iving
Tokeniving
Feature activation+1.025
the
Token the
Feature activation+0.000
shareholders
Token shareholders
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
first
Token first
Feature activation+0.000
issue
Token issue
Feature activation+0.000
,
Token,
Feature activation+0.000
Spl
Token Spl
Feature activation+0.000
inter
Tokeninter
Feature activation+0.000
sees
Token sees
Feature activation+1.112
the
Token the
Feature activation+0.000
can
Token can
Feature activation+0.000
ister
Tokenister
Feature activation+0.000
strike
Token strike
Feature activation+0.000
a
Token a
Feature activation+0.000

INTERVAL 0.703 - 0.938
CONTAINS 0.000%

three
Token three
Feature activation+0.000
-
Token-
Feature activation+0.000
hour
Tokenhour
Feature activation+0.000
ab
Token ab
Feature activation+0.000
rid
Tokenrid
Feature activation+0.000
ged
Tokenged
Feature activation+0.919
version
Token version
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
text
Token text
Feature activation+0.000
over
Token over
Feature activation+0.000

INTERVAL 0.469 - 0.703
CONTAINS 0.000%

separating
Token separating
Feature activation+0.000
the
Token the
Feature activation+0.000
chamber
Token chamber
Feature activation+0.000
from
Token from
Feature activation+0.000
the
Token the
Feature activation+0.000
viewing
Token viewing
Feature activation+0.654
room
Token room
Feature activation+0.000
were
Token were
Feature activation+0.000
lowered
Token lowered
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
supports
Token supports
Feature activation+0.000
,
Token,
Feature activation+0.000
l
Token l
Feature activation+0.000
adders
Tokenadders
Feature activation+0.000
and
Token and
Feature activation+0.000
steps
Token steps
Feature activation+0.499
,
Token,
Feature activation+0.000
perform
Token perform
Feature activation+0.000
a
Token a
Feature activation+0.000
mechanical
Token mechanical
Feature activation+0.000
ballet
Token ballet
Feature activation+0.000
from
Token from
Feature activation+0.000
members
Token members
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
PBS
Token PBS
Feature activation+0.000
show
Token show
Feature activation+0.493
"
Token "
Feature activation+0.000
This
TokenThis
Feature activation+0.000
Old
Token Old
Feature activation+0.000
House
Token House
Feature activation+0.000
",
Token",
Feature activation+0.000

INTERVAL 0.234 - 0.469
CONTAINS 0.001%

ative
Tokenative
Feature activation+0.000
,
Token,
Feature activation+0.000
no
Token no
Feature activation+0.000
-
Token-
Feature activation+0.000
hold
Tokenhold
Feature activation+0.000
s
Tokens
Feature activation+0.312
-
Token-
Feature activation+0.000
bar
Tokenbar
Feature activation+0.000
red
Tokenred
Feature activation+0.000
politics
Token politics
Feature activation+0.000
that
Token that
Feature activation+0.000
but
Token but
Feature activation+0.000
there
Token there
Feature activation+0.000
's
Token's
Feature activation+0.000
no
Token no
Feature activation+0.000
mist
Token mist
Feature activation+0.000
aking
Tokenaking
Feature activation+0.418
Wisconsin
Token Wisconsin
Feature activation+0.000
as
Token as
Feature activation+0.000
a
Token a
Feature activation+0.000
contender
Token contender
Feature activation+0.000
to
Token to
Feature activation+0.000
current
Token current
Feature activation+0.000
situation
Token situation
Feature activation+0.000
to
Token to
Feature activation+0.000
standing
Token standing
Feature activation+0.000
blind
Token blind
Feature activation+0.363
fold
Tokenfold
Feature activation+0.357
ed
Tokened
Feature activation+0.000
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
batting
Token batting
Feature activation+0.000
cage
Token cage
Feature activation+0.000
opportunity
Token opportunity
Feature activation+0.000
but
Token but
Feature activation+0.000
I
Token I
Feature activation+0.000
am
Token am
Feature activation+0.000
not
Token not
Feature activation+0.000
blind
Token blind
Feature activation+0.434
to
Token to
Feature activation+0.000
other
Token other
Feature activation+0.000
types
Token types
Feature activation+0.000
of
Token of
Feature activation+0.000
alternative
Token alternative
Feature activation+0.000
straight
Token straight
Feature activation+0.000
-
Token-
Feature activation+0.000
faced
Tokenfaced
Feature activation+0.000
reading
Token reading
Feature activation+0.000
of
Token of
Feature activation+0.000
cause
Token cause
Feature activation+0.286
of
Token of
Feature activation+0.000
death
Token death
Feature activation+0.173
:
Token:
Feature activation+0.000
demon
Token demon
Feature activation+0.000
is
Token is
Feature activation+0.000

INTERVAL 0.000 - 0.234
CONTAINS 99.998%

picnic
Token picnic
Feature activation+0.000
.
Token.
Feature activation+0.000
A
Token A
Feature activation+0.000
large
Token large
Feature activation+0.000
and
Token and
Feature activation+0.000
angry
Token angry
Feature activation+0.000
thr
Token thr
Feature activation+0.000
ong
Tokenong
Feature activation+0.000
of
Token of
Feature activation+0.000
Ford
Token Ford
Feature activation+0.000
supporters
Token supporters
Feature activation+0.000
problem
Token problem
Feature activation+0.000
with
Token with
Feature activation+0.000
intermittent
Token intermittent
Feature activation+0.000
skipping
Token skipping
Feature activation+0.000
,
Token,
Feature activation+0.000
so
Token so
Feature activation+0.000
I
Token I
Feature activation+0.000
stopped
Token stopped
Feature activation+0.000
using
Token using
Feature activation+0.000
it
Token it
Feature activation+0.000
not
Token not
Feature activation+0.000
total
Token total
Feature activation+0.000
of
Token of
Feature activation+0.000
around
Token around
Feature activation+0.000
15
Token 15
Feature activation+0.000
trillion
Token trillion
Feature activation+0.000
yen
Token yen
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Under
TokenUnder
Feature activation+0.000
Kuro
Token Kuro
Feature activation+0.000
30
Token 30
Feature activation+0.000
years
Token years
Feature activation+0.000
in
Token in
Feature activation+0.000
prison
Token prison
Feature activation+0.000
and
Token and
Feature activation+0.000
institutional
Token institutional
Feature activation+0.000
ized
Tokenized
Feature activation+0.000
psychiatric
Token psychiatric
Feature activation+0.000
care
Token care
Feature activation+0.000
in
Token in
Feature activation+0.000
November
Token November
Feature activation+0.000
told
Token told
Feature activation+0.000
my
Token my
Feature activation+0.000
parents
Token parents
Feature activation+0.000
that
Token that
Feature activation+0.000
I
Token I
Feature activation+0.000
never
Token never
Feature activation+0.000
wanted
Token wanted
Feature activation+0.000
kids
Token kids
Feature activation+0.000
,
Token,
Feature activation+0.000
just
Token just
Feature activation+0.000
dogs
Token dogs
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 3 in H1.6: (feature 8927

TOP ACTIVATIONS
MAX = 2.536

,
Token,
Feature activation+0.000
most
Token most
Feature activation+0.000
such
Token such
Feature activation+0.000
marriages
Token marriages
Feature activation+0.081
are
Token are
Feature activation+0.025
arranged
Token arranged
Feature activation+2.536
through
Token through
Feature activation+0.114
illegal
Token illegal
Feature activation+0.272
channels
Token channels
Feature activation+0.543
,
Token,
Feature activation+0.000
according
Token according
Feature activation+0.000
,
Token,
Feature activation+0.000
as
Token as
Feature activation+0.000
well
Token well
Feature activation+0.000
as
Token as
Feature activation+0.000
the
Token the
Feature activation+0.000
couple
Token couple
Feature activation+2.474
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
30
Token 30
Feature activation+0.000
-
Token-
Feature activation+0.000
ourt
Tokenourt
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
together
Token together
Feature activation+0.537
the
Token the
Feature activation+0.000
couple
Token couple
Feature activation+2.473
used
Token used
Feature activation+0.000
their
Token their
Feature activation+0.320
vast
Token vast
Feature activation+0.000
holdings
Token holdings
Feature activation+0.000
in
Token in
Feature activation+0.000
and
Token and
Feature activation+0.730
a
Token a
Feature activation+0.195
woman
Token woman
Feature activation+1.348
and
Token and
Feature activation+0.508
this
Token this
Feature activation+0.000
union
Token union
Feature activation+2.452
must
Token must
Feature activation+0.000
be
Token be
Feature activation+0.031
preserved
Token preserved
Feature activation+0.224
,"
Token,"
Feature activation+0.000
conc
Token conc
Feature activation+1.436
to
Token to
Feature activation+0.000
350
Token 350
Feature activation+0.064
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
couples
Token couples
Feature activation+2.220
splitting
Token splitting
Feature activation+2.294
in
Token in
Feature activation+0.365
a
Token a
Feature activation+0.522
year
Token year
Feature activation+0.250
since
Token since
Feature activation+0.000
2010
Token 2010
Feature activation+0.048
"
Token"
Feature activation+0.000
I
TokenI
Feature activation+0.000
believe
Token believe
Feature activation+0.000
marriage
Token marriage
Feature activation+0.000
is
Token is
Feature activation+0.000
between
Token between
Feature activation+2.281
a
Token a
Feature activation+0.074
man
Token man
Feature activation+1.238
and
Token and
Feature activation+0.730
a
Token a
Feature activation+0.195
woman
Token woman
Feature activation+1.348
leads
Token leads
Feature activation+0.000
to
Token to
Feature activation+0.000
350
Token 350
Feature activation+0.064
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
couples
Token couples
Feature activation+2.220
splitting
Token splitting
Feature activation+2.294
in
Token in
Feature activation+0.365
a
Token a
Feature activation+0.522
year
Token year
Feature activation+0.250
since
Token since
Feature activation+0.000
Steve
Token Steve
Feature activation+0.000
Basil
Token Basil
Feature activation+0.000
one
Tokenone
Feature activation+0.000
,
Token,
Feature activation+0.000
recently
Token recently
Feature activation+0.000
filed
Token filed
Feature activation+2.095
a
Token a
Feature activation+0.113
joint
Token joint
Feature activation+1.713
petition
Token petition
Feature activation+1.449
for
Token for
Feature activation+0.000
divorce
Token divorce
Feature activation+1.166
after
Token after
Feature activation+0.000
a
Token a
Feature activation+0.000
man
Token man
Feature activation+0.000
whose
Token whose
Feature activation+0.000
marriage
Token marriage
Feature activation+0.000
proposal
Token proposal
Feature activation+1.992
she
Token she
Feature activation+0.000
allegedly
Token allegedly
Feature activation+0.000
sp
Token sp
Feature activation+0.000
urned
Tokenurned
Feature activation+0.000
attacked
Token attacked
Feature activation+0.000
April
Token April
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
couple
Token couple
Feature activation+1.930
sealed
Token sealed
Feature activation+0.051
their
Token their
Feature activation+0.729
marriage
Token marriage
Feature activation+0.589
with
Token with
Feature activation+1.236
corresponding
Token corresponding
Feature activation+0.298
marry
Token marry
Feature activation+0.253
after
Token after
Feature activation+0.427
sometime
Token sometime
Feature activation+0.000
.
Token.
Feature activation+0.000
In
Token In
Feature activation+0.000
between
Token between
Feature activation+1.835
,
Token,
Feature activation+0.000
I
Token I
Feature activation+0.249
met
Token met
Feature activation+0.945
a
Token a
Feature activation+0.121
girl
Token girl
Feature activation+0.683
resulting
Token resulting
Feature activation+0.000
in
Token in
Feature activation+0.000
thousands
Token thousands
Feature activation+0.000
of
Token of
Feature activation+0.000
marriages
Token marriages
Feature activation+0.415
between
Token between
Feature activation+1.831
Vietnamese
Token Vietnamese
Feature activation+0.000
women
Token women
Feature activation+0.727
and
Token and
Feature activation+0.601
foreign
Token foreign
Feature activation+0.000
men
Token men
Feature activation+0.184
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
While
TokenWhile
Feature activation+0.000
there
Token there
Feature activation+0.000
are
Token are
Feature activation+0.000
some
Token some
Feature activation+0.000
couples
Token couples
Feature activation+1.830
who
Token who
Feature activation+0.384
can
Token can
Feature activation+0.000
pull
Token pull
Feature activation+0.321
off
Token off
Feature activation+0.000
weddings
Token weddings
Feature activation+1.569
only
Token only
Feature activation+0.000
route
Token route
Feature activation+0.000
to
Token to
Feature activation+0.000
having
Token having
Feature activation+1.210
a
Token a
Feature activation+0.108
baby
Token baby
Feature activation+1.821
is
Token is
Feature activation+0.000
international
Token international
Feature activation+0.000
surrog
Token surrog
Feature activation+1.342
acy
Tokenacy
Feature activation+0.000
.
Token.
Feature activation+0.000
in
Token in
Feature activation+0.000
Hong
Token Hong
Feature activation+0.000
Kong
Token Kong
Feature activation+0.000
are
Token are
Feature activation+0.000
valid
Token valid
Feature activation+0.000
relationships
Token relationships
Feature activation+1.818
.
Token.
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
har
Tokenhar
Feature activation+0.000
tha
Tokentha
Feature activation+0.000
,
Token,
Feature activation+0.000
whose
Token whose
Feature activation+0.000
marriage
Token marriage
Feature activation+0.000
proposal
Token proposal
Feature activation+1.798
she
Token she
Feature activation+0.000
turned
Token turned
Feature activation+0.000
down
Token down
Feature activation+0.000
weeks
Token weeks
Feature activation+0.000
ago
Token ago
Feature activation+0.000
a
Token a
Feature activation+0.000
recognition
Token recognition
Feature activation+0.000
that
Token that
Feature activation+0.000
same
Token same
Feature activation+0.181
sex
Token sex
Feature activation+0.317
relationships
Token relationships
Feature activation+1.788
in
Token in
Feature activation+0.000
Hong
Token Hong
Feature activation+0.000
Kong
Token Kong
Feature activation+0.000
are
Token are
Feature activation+0.000
valid
Token valid
Feature activation+0.000
continue
Token continue
Feature activation+0.000
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.393
live
Token live
Feature activation+0.504
in
Token in
Feature activation+0.000
relationship
Token relationship
Feature activation+1.765
with
Token with
Feature activation+0.637
first
Token first
Feature activation+0.000
girl
Token girl
Feature activation+0.248
and
Token and
Feature activation+0.000
we
Token we
Feature activation+0.315
one
Tokenone
Feature activation+0.000
,
Token,
Feature activation+0.000
recently
Token recently
Feature activation+0.000
filed
Token filed
Feature activation+2.095
a
Token a
Feature activation+0.113
joint
Token joint
Feature activation+1.713
petition
Token petition
Feature activation+1.449
for
Token for
Feature activation+0.000
divorce
Token divorce
Feature activation+1.166
according
Token according
Feature activation+0.211
to
Token to
Feature activation+0.000
/
Token/
Feature activation+0.000
pse
Tokenpse
Feature activation+0.000
udo
Tokenudo
Feature activation+0.000
-
Token-
Feature activation+0.000
se
Tokense
Feature activation+0.000
xy
Tokenxy
Feature activation+1.672
times
Token times
Feature activation+0.000
that
Token that
Feature activation+0.000
occur
Token occur
Feature activation+0.000
on
Token on
Feature activation+0.000
the
Token the
Feature activation+0.000

Top DFA by src position
MAX = 3.293

,
Token,
Feature activation+0.008
Top resid features:
resulting
Token resulting
Feature activation+0.017
Top resid features:
in
Token in
Feature activation+0.013
Top resid features:
thousands
Token thousands
Feature activation-0.005
Top resid features:
of
Token of
Feature activation+0.006
Top resid features:
marriages
Token marriages
Feature activation+1.447
Top resid features:
between
Token between
Feature activation+0.045
Top resid features:
Vietnamese
Token Vietnamese
Feature activation+0.014
Top resid features:
women
Token women
Feature activation+0.045
Top resid features:
and
Token and
Feature activation+0.013
Top resid features:
foreign
Token foreign
Feature activation-0.002
Top resid features:
s
Tokens
Feature activation-0.016
Top resid features:
death
Token death
Feature activation+0.006
Top resid features:
and
Token and
Feature activation-0.039
Top resid features:
his
Token his
Feature activation-0.023
Top resid features:
second
Token second
Feature activation-0.001
Top resid features:
marriage
Token marriage
Feature activation+1.592
Top resid features:
,
Token,
Feature activation-0.099
Top resid features:
as
Token as
Feature activation-0.060
Top resid features:
well
Token well
Feature activation-0.060
Top resid features:
as
Token as
Feature activation-0.059
Top resid features:
the
Token the
Feature activation+0.053
Top resid features:
and
Token and
Feature activation-0.013
Top resid features:
lawyer
Token lawyer
Feature activation-0.028
Top resid features:
,
Token,
Feature activation-0.026
Top resid features:
she
Token she
Feature activation-0.041
Top resid features:
was
Token was
Feature activation-0.033
Top resid features:
married
Token married
Feature activation+3.145
Top resid features:
to
Token to
Feature activation-0.024
Top resid features:
Frank
Token Frank
Feature activation-0.002
Top resid features:
McC
Token McC
Feature activation-0.013
Top resid features:
ourt
Tokenourt
Feature activation-0.018
Top resid features:
,
Token,
Feature activation-0.053
Top resid features:
Ċ
TokenĊ
Feature activation+0.008
Top resid features:
Ċ
TokenĊ
Feature activation-0.002
Top resid features:
"
Token"
Feature activation-0.013
Top resid features:
I
TokenI
Feature activation+0.023
Top resid features:
believe
Token believe
Feature activation+0.013
Top resid features:
marriage
Token marriage
Feature activation+3.146
Top resid features:
is
Token is
Feature activation+0.022
Top resid features:
between
Token between
Feature activation+0.132
Top resid features:
a
Token a
Feature activation+0.016
Top resid features:
man
Token man
Feature activation+0.022
Top resid features:
and
Token and
Feature activation-0.055
Top resid features:
leads
Token leads
Feature activation-0.044
Top resid features:
to
Token to
Feature activation-0.021
Top resid features:
350
Token 350
Feature activation-0.044
Top resid features:
,
Token,
Feature activation-0.041
Top resid features:
000
Token000
Feature activation-0.042
Top resid features:
couples
Token couples
Feature activation+1.623
Top resid features:
splitting
Token splitting
Feature activation+0.055
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
year
Token year
Feature activation+0.000
Top resid features:
since
Token since
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation-0.013
Top resid features:
Ċ
TokenĊ
Feature activation-0.038
Top resid features:
"
Token"
Feature activation-0.059
Top resid features:
I
TokenI
Feature activation-0.014
Top resid features:
believe
Token believe
Feature activation-0.028
Top resid features:
marriage
Token marriage
Feature activation+3.293
Top resid features:
is
Token is
Feature activation-0.082
Top resid features:
between
Token between
Feature activation-0.045
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
man
Token man
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
of
Token of
Feature activation-0.011
Top resid features:
no
Token no
Feature activation-0.005
Top resid features:
-
Token-
Feature activation-0.001
Top resid features:
f
Tokenf
Feature activation+0.012
Top resid features:
ault
Tokenault
Feature activation-0.032
Top resid features:
divorce
Token divorce
Feature activation+2.682
Top resid features:
leads
Token leads
Feature activation-0.025
Top resid features:
to
Token to
Feature activation-0.026
Top resid features:
350
Token 350
Feature activation-0.046
Top resid features:
,
Token,
Feature activation-0.050
Top resid features:
000
Token000
Feature activation-0.018
Top resid features:
New
Token New
Feature activation-0.010
Top resid features:
Black
Token Black
Feature activation-0.007
Top resid features:
has
Token has
Feature activation+0.027
Top resid features:
filed
Token filed
Feature activation+0.030
Top resid features:
for
Token for
Feature activation+0.001
Top resid features:
divorce
Token divorce
Feature activation+2.881
Top resid features:
from
Token from
Feature activation+0.012
Top resid features:
her
Token her
Feature activation+0.022
Top resid features:
husband
Token husband
Feature activation+0.093
Top resid features:
after
Token after
Feature activation+0.003
Top resid features:
realizing
Token realizing
Feature activation+0.010
Top resid features:
body
Token body
Feature activation+0.007
Top resid features:
after
Token after
Feature activation-0.054
Top resid features:
a
Token a
Feature activation-0.014
Top resid features:
man
Token man
Feature activation+0.001
Top resid features:
whose
Token whose
Feature activation-0.120
Top resid features:
marriage
Token marriage
Feature activation+3.121
Top resid features:
proposal
Token proposal
Feature activation+0.084
Top resid features:
she
Token she
Feature activation+0.000
Top resid features:
allegedly
Token allegedly
Feature activation+0.000
Top resid features:
sp
Token sp
Feature activation+0.000
Top resid features:
urned
Tokenurned
Feature activation+0.000
Top resid features:
relationship
Token relationship
Feature activation+0.241
Top resid features:
with
Token with
Feature activation-0.009
Top resid features:
her
Token her
Feature activation-0.021
Top resid features:
now
Token now
Feature activation-0.005
Top resid features:
-
Token-
Feature activation-0.006
Top resid features:
husband
Tokenhusband
Feature activation+0.890
Top resid features:
,
Token,
Feature activation-0.025
Top resid features:
Brad
Token Brad
Feature activation-0.014
Top resid features:
Ox
Token Ox
Feature activation-0.012
Top resid features:
ley
Tokenley
Feature activation-0.006
Top resid features:
,
Token,
Feature activation-0.037
Top resid features:
and
Token and
Feature activation-0.025
Top resid features:
we
Token we
Feature activation-0.004
Top resid features:
had
Token had
Feature activation-0.018
Top resid features:
intention
Token intention
Feature activation-0.011
Top resid features:
to
Token to
Feature activation-0.029
Top resid features:
marry
Token marry
Feature activation+1.084
Top resid features:
after
Token after
Feature activation-0.026
Top resid features:
sometime
Token sometime
Feature activation-0.014
Top resid features:
.
Token.
Feature activation-0.145
Top resid features:
In
Token In
Feature activation+0.121
Top resid features:
between
Token between
Feature activation-0.153
Top resid features:
,
Token,
Feature activation-0.028
Top resid features:
resulting
Token resulting
Feature activation+0.040
Top resid features:
in
Token in
Feature activation+0.016
Top resid features:
thousands
Token thousands
Feature activation-0.007
Top resid features:
of
Token of
Feature activation-0.056
Top resid features:
marriages
Token marriages
Feature activation+2.560
Top resid features:
between
Token between
Feature activation-0.112
Top resid features:
Vietnamese
Token Vietnamese
Feature activation+0.000
Top resid features:
women
Token women
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
foreign
Token foreign
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.142
Top resid features:
While
TokenWhile
Feature activation+0.005
Top resid features:
there
Token there
Feature activation+0.005
Top resid features:
are
Token are
Feature activation-0.092
Top resid features:
some
Token some
Feature activation+0.066
Top resid features:
couples
Token couples
Feature activation+2.647
Top resid features:
who
Token who
Feature activation+0.000
Top resid features:
can
Token can
Feature activation+0.000
Top resid features:
pull
Token pull
Feature activation+0.000
Top resid features:
off
Token off
Feature activation+0.000
Top resid features:
weddings
Token weddings
Feature activation+0.000
Top resid features:
igree
Tokenigree
Feature activation+0.016
Top resid features:
.
Token.
Feature activation-0.019
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.045
Top resid features:
For
TokenFor
Feature activation-0.005
Top resid features:
many
Token many
Feature activation-0.007
Top resid features:
couples
Token couples
Feature activation+3.156
Top resid features:
in
Token in
Feature activation-0.017
Top resid features:
the
Token the
Feature activation-0.003
Top resid features:
developed
Token developed
Feature activation-0.044
Top resid features:
world
Token world
Feature activation-0.038
Top resid features:
,
Token,
Feature activation-0.065
Top resid features:
towards
Token towards
Feature activation-0.024
Top resid features:
equality
Token equality
Feature activation-0.007
Top resid features:
for
Token for
Feature activation-0.014
Top resid features:
same
Token same
Feature activation+0.014
Top resid features:
sex
Token sex
Feature activation+0.202
Top resid features:
couples
Token couples
Feature activation+2.706
Top resid features:
in
Token in
Feature activation-0.010
Top resid features:
Hong
Token Hong
Feature activation+0.004
Top resid features:
Kong
Token Kong
Feature activation-0.027
Top resid features:
.
Token.
Feature activation-0.038
Top resid features:
It
Token It
Feature activation-0.003
Top resid features:
Sidd
Token Sidd
Feature activation-0.055
Top resid features:
har
Tokenhar
Feature activation-0.030
Top resid features:
tha
Tokentha
Feature activation-0.036
Top resid features:
,
Token,
Feature activation-0.122
Top resid features:
whose
Token whose
Feature activation-0.066
Top resid features:
marriage
Token marriage
Feature activation+2.907
Top resid features:
proposal
Token proposal
Feature activation+0.093
Top resid features:
she
Token she
Feature activation+0.000
Top resid features:
turned
Token turned
Feature activation+0.000
Top resid features:
down
Token down
Feature activation+0.000
Top resid features:
weeks
Token weeks
Feature activation+0.000
Top resid features:
towards
Token towards
Feature activation-0.033
Top resid features:
equality
Token equality
Feature activation-0.016
Top resid features:
for
Token for
Feature activation-0.023
Top resid features:
same
Token same
Feature activation+0.016
Top resid features:
sex
Token sex
Feature activation+0.210
Top resid features:
couples
Token couples
Feature activation+2.745
Top resid features:
in
Token in
Feature activation-0.016
Top resid features:
Hong
Token Hong
Feature activation+0.010
Top resid features:
Kong
Token Kong
Feature activation-0.037
Top resid features:
.
Token.
Feature activation-0.076
Top resid features:
It
Token It
Feature activation-0.004
Top resid features:
without
Token without
Feature activation+0.006
Top resid features:
me
Token me
Feature activation+0.011
Top resid features:
and
Token and
Feature activation+0.004
Top resid features:
demanded
Token demanded
Feature activation-0.004
Top resid features:
to
Token to
Feature activation+0.006
Top resid features:
marry
Token marry
Feature activation+1.709
Top resid features:
her
Token her
Feature activation+0.047
Top resid features:
.
Token.
Feature activation+0.006
Top resid features:
I
Token I
Feature activation+0.018
Top resid features:
told
Token told
Feature activation+0.001
Top resid features:
this
Token this
Feature activation+0.013
Top resid features:
New
Token New
Feature activation-0.007
Top resid features:
Black
Token Black
Feature activation-0.008
Top resid features:
has
Token has
Feature activation+0.023
Top resid features:
filed
Token filed
Feature activation+0.091
Top resid features:
for
Token for
Feature activation+0.004
Top resid features:
divorce
Token divorce
Feature activation+2.162
Top resid features:
from
Token from
Feature activation+0.010
Top resid features:
her
Token her
Feature activation+0.026
Top resid features:
husband
Token husband
Feature activation+0.141
Top resid features:
after
Token after
Feature activation+0.009
Top resid features:
realizing
Token realizing
Feature activation+0.005
Top resid features:
Ċ
TokenĊ
Feature activation-0.007
Top resid features:
Turn
TokenTurn
Feature activation+0.002
Top resid features:
s
Tokens
Feature activation+0.007
Top resid features:
out
Token out
Feature activation-0.005
Top resid features:
that
Token that
Feature activation-0.005
Top resid features:
couples
Token couples
Feature activation+2.021
Top resid features:
(
Token (
Feature activation-0.005
Top resid features:
well
Tokenwell
Feature activation-0.002
Top resid features:
,
Token,
Feature activation-0.008
Top resid features:
the
Token the
Feature activation+0.005
Top resid features:
worst
Token worst
Feature activation-0.008
Top resid features:

Decoder Weights Distribution

Head 0: 0.04

Head 1: 0.04

Head 2: 0.02

Head 3: 0.02

Head 4: 0.04

Head 5: 0.09

Head 6: 0.54

Head 7: 0.06

Head 8: 0.03

Head 9: 0.03

Head 10: 0.02

Head 11: 0.07

Positive logits

twins2.41

arenthood2.39

marriage2.36

divor2.24

Marriage2.19

equality2.17

marriage2.11

couples2.10

spouses2.08

spouse2.03

compan1.99

Loving1.97

divorce1.96

fiance1.93

marry1.93

wife1.92

Counsel1.91

solicitor1.91

marriages1.85

Family1.84

Negative logits

meteor-2.22

-2.06

nit-2.03

ompl-1.90

patrolling-1.86

Beam-1.82

monary-1.81

velength-1.78

actionGroup-1.76

Nare-1.73

iencies-1.73

ion-1.73

looting-1.72

scaven-1.71

Cavern-1.70

pellets-1.70

bunker-1.69

KN-1.68

projectile-1.66

replay-1.66

INTERVAL 2.282 - 2.536
CONTAINS 0.001%

,
Token,
Feature activation+0.000
as
Token as
Feature activation+0.000
well
Token well
Feature activation+0.000
as
Token as
Feature activation+0.000
the
Token the
Feature activation+0.000
couple
Token couple
Feature activation+2.474
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
30
Token 30
Feature activation+0.000
-
Token-
Feature activation+0.000
ourt
Tokenourt
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
together
Token together
Feature activation+0.537
the
Token the
Feature activation+0.000
couple
Token couple
Feature activation+2.473
used
Token used
Feature activation+0.000
their
Token their
Feature activation+0.320
vast
Token vast
Feature activation+0.000
holdings
Token holdings
Feature activation+0.000
in
Token in
Feature activation+0.000
,
Token,
Feature activation+0.000
most
Token most
Feature activation+0.000
such
Token such
Feature activation+0.000
marriages
Token marriages
Feature activation+0.081
are
Token are
Feature activation+0.025
arranged
Token arranged
Feature activation+2.536
through
Token through
Feature activation+0.114
illegal
Token illegal
Feature activation+0.272
channels
Token channels
Feature activation+0.543
,
Token,
Feature activation+0.000
according
Token according
Feature activation+0.000
to
Token to
Feature activation+0.000
350
Token 350
Feature activation+0.064
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
couples
Token couples
Feature activation+2.220
splitting
Token splitting
Feature activation+2.294
in
Token in
Feature activation+0.365
a
Token a
Feature activation+0.522
year
Token year
Feature activation+0.250
since
Token since
Feature activation+0.000
2010
Token 2010
Feature activation+0.048
and
Token and
Feature activation+0.730
a
Token a
Feature activation+0.195
woman
Token woman
Feature activation+1.348
and
Token and
Feature activation+0.508
this
Token this
Feature activation+0.000
union
Token union
Feature activation+2.452
must
Token must
Feature activation+0.000
be
Token be
Feature activation+0.031
preserved
Token preserved
Feature activation+0.224
,"
Token,"
Feature activation+0.000
conc
Token conc
Feature activation+1.436

INTERVAL 2.029 - 2.282
CONTAINS 0.000%

leads
Token leads
Feature activation+0.000
to
Token to
Feature activation+0.000
350
Token 350
Feature activation+0.064
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
couples
Token couples
Feature activation+2.220
splitting
Token splitting
Feature activation+2.294
in
Token in
Feature activation+0.365
a
Token a
Feature activation+0.522
year
Token year
Feature activation+0.250
since
Token since
Feature activation+0.000
"
Token"
Feature activation+0.000
I
TokenI
Feature activation+0.000
believe
Token believe
Feature activation+0.000
marriage
Token marriage
Feature activation+0.000
is
Token is
Feature activation+0.000
between
Token between
Feature activation+2.281
a
Token a
Feature activation+0.074
man
Token man
Feature activation+1.238
and
Token and
Feature activation+0.730
a
Token a
Feature activation+0.195
woman
Token woman
Feature activation+1.348
Steve
Token Steve
Feature activation+0.000
Basil
Token Basil
Feature activation+0.000
one
Tokenone
Feature activation+0.000
,
Token,
Feature activation+0.000
recently
Token recently
Feature activation+0.000
filed
Token filed
Feature activation+2.095
a
Token a
Feature activation+0.113
joint
Token joint
Feature activation+1.713
petition
Token petition
Feature activation+1.449
for
Token for
Feature activation+0.000
divorce
Token divorce
Feature activation+1.166

INTERVAL 1.775 - 2.029
CONTAINS 0.001%

resulting
Token resulting
Feature activation+0.000
in
Token in
Feature activation+0.000
thousands
Token thousands
Feature activation+0.000
of
Token of
Feature activation+0.000
marriages
Token marriages
Feature activation+0.415
between
Token between
Feature activation+1.831
Vietnamese
Token Vietnamese
Feature activation+0.000
women
Token women
Feature activation+0.727
and
Token and
Feature activation+0.601
foreign
Token foreign
Feature activation+0.000
men
Token men
Feature activation+0.184
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
While
TokenWhile
Feature activation+0.000
there
Token there
Feature activation+0.000
are
Token are
Feature activation+0.000
some
Token some
Feature activation+0.000
couples
Token couples
Feature activation+1.830
who
Token who
Feature activation+0.384
can
Token can
Feature activation+0.000
pull
Token pull
Feature activation+0.321
off
Token off
Feature activation+0.000
weddings
Token weddings
Feature activation+1.569
April
Token April
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
couple
Token couple
Feature activation+1.930
sealed
Token sealed
Feature activation+0.051
their
Token their
Feature activation+0.729
marriage
Token marriage
Feature activation+0.589
with
Token with
Feature activation+1.236
corresponding
Token corresponding
Feature activation+0.298
only
Token only
Feature activation+0.000
route
Token route
Feature activation+0.000
to
Token to
Feature activation+0.000
having
Token having
Feature activation+1.210
a
Token a
Feature activation+0.108
baby
Token baby
Feature activation+1.821
is
Token is
Feature activation+0.000
international
Token international
Feature activation+0.000
surrog
Token surrog
Feature activation+1.342
acy
Tokenacy
Feature activation+0.000
.
Token.
Feature activation+0.000
in
Token in
Feature activation+0.000
Hong
Token Hong
Feature activation+0.000
Kong
Token Kong
Feature activation+0.000
are
Token are
Feature activation+0.000
valid
Token valid
Feature activation+0.000
relationships
Token relationships
Feature activation+1.818
.
Token.
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

INTERVAL 1.522 - 1.775
CONTAINS 0.001%

shows
Token shows
Feature activation+0.000
the
Token the
Feature activation+0.000
mob
Token mob
Feature activation+0.000
pulling
Token pulling
Feature activation+0.000
the
Token the
Feature activation+0.000
couple
Token couple
Feature activation+1.636
out
Token out
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
guest
Token guest
Feature activation+0.000
house
Token house
Feature activation+0.000
announcing
Token announcing
Feature activation+0.000
that
Token that
Feature activation+0.000
she
Token she
Feature activation+0.308
had
Token had
Feature activation+0.000
already
Token already
Feature activation+0.000
divorced
Token divorced
Feature activation+1.548
him
Token him
Feature activation+0.377
in
Token in
Feature activation+0.000
Ukraine
Token Ukraine
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
sl
Tokensl
Feature activation+0.000
ash
Tokenash
Feature activation+0.000
-
Token-
Feature activation+0.000
real
Tokenreal
Feature activation+0.000
married
Token married
Feature activation+0.000
couple
Token couple
Feature activation+1.638
Paul
Token Paul
Feature activation+0.000
Rust
Token Rust
Feature activation+0.000
and
Token and
Feature activation+0.000
Les
Token Les
Feature activation+0.000
ley
Tokenley
Feature activation+0.000
than
Token than
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
being
Token being
Feature activation+0.000
a
Token a
Feature activation+0.106
wedding
Token wedding
Feature activation+1.629
guest
Token guest
Feature activation+0.782
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.079
total
Token total
Feature activation+0.000
pain
Token pain
Feature activation+0.000
continue
Token continue
Feature activation+0.000
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.393
live
Token live
Feature activation+0.504
in
Token in
Feature activation+0.000
relationship
Token relationship
Feature activation+1.765
with
Token with
Feature activation+0.637
first
Token first
Feature activation+0.000
girl
Token girl
Feature activation+0.248
and
Token and
Feature activation+0.000
we
Token we
Feature activation+0.315

INTERVAL 1.268 - 1.522
CONTAINS 0.002%

lli
Tokenlli
Feature activation+0.000
and
Token and
Feature activation+0.000
her
Token her
Feature activation+0.120
husband
Token husband
Feature activation+0.721
of
Token of
Feature activation+0.000
two
Token two
Feature activation+1.310
years
Token years
Feature activation+0.000
,
Token,
Feature activation+0.000
Steve
Token Steve
Feature activation+0.000
Basil
Token Basil
Feature activation+0.000
one
Tokenone
Feature activation+0.000
's
Token's
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
interest
Token interest
Feature activation+0.522
of
Token of
Feature activation+0.000
both
Token both
Feature activation+1.285
the
Token the
Feature activation+0.000
UK
Token UK
Feature activation+0.000
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
EU
Token EU
Feature activation+0.000
over
Token over
Feature activation+0.000
her
Token her
Feature activation+0.790
money
Token money
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
divorce
Token divorce
Feature activation+1.427
courts
Token courts
Feature activation+1.555
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Uk
TokenUk
Feature activation+0.199
having
Token having
Feature activation+1.210
a
Token a
Feature activation+0.108
baby
Token baby
Feature activation+1.821
is
Token is
Feature activation+0.000
international
Token international
Feature activation+0.000
surrog
Token surrog
Feature activation+1.342
acy
Tokenacy
Feature activation+0.000
.
Token.
Feature activation+0.000
But
Token But
Feature activation+0.000
the
Token the
Feature activation+0.000
rapid
Token rapid
Feature activation+0.000
during
Token during
Feature activation+0.000
their
Token their
Feature activation+0.215
12
Token 12
Feature activation+0.000
-
Token-
Feature activation+0.000
year
Tokenyear
Feature activation+0.000
relationship
Token relationship
Feature activation+1.270
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Before
TokenBefore
Feature activation+0.000
they
Token they
Feature activation+0.230

INTERVAL 1.014 - 1.268
CONTAINS 0.003%

my
Token my
Feature activation+0.271
love
Token love
Feature activation+1.105
and
Token and
Feature activation+0.000
then
Token then
Feature activation+0.000
have
Token have
Feature activation+0.123
sex
Token sex
Feature activation+1.266
with
Token with
Feature activation+0.346
other
Token other
Feature activation+0.000
people
Token people
Feature activation+0.512
behind
Token behind
Feature activation+0.019
their
Token their
Feature activation+0.254
coat
Token coat
Feature activation+0.000
"
Token"
Feature activation+0.000
by
Token by
Feature activation+0.000
keeping
Token keeping
Feature activation+0.000
her
Token her
Feature activation+0.187
husband
Token husband
Feature activation+1.140
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
dark
Token dark
Feature activation+0.000
about
Token about
Feature activation+0.000
the
Token the
Feature activation+0.000
filed
Token filed
Feature activation+2.095
a
Token a
Feature activation+0.113
joint
Token joint
Feature activation+1.713
petition
Token petition
Feature activation+1.449
for
Token for
Feature activation+0.000
divorce
Token divorce
Feature activation+1.166
according
Token according
Feature activation+0.211
to
Token to
Feature activation+0.000
TMZ
Token TMZ
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
believe
Token believe
Feature activation+0.000
marriage
Token marriage
Feature activation+0.000
is
Token is
Feature activation+0.000
between
Token between
Feature activation+2.281
a
Token a
Feature activation+0.074
man
Token man
Feature activation+1.238
and
Token and
Feature activation+0.730
a
Token a
Feature activation+0.195
woman
Token woman
Feature activation+1.348
and
Token and
Feature activation+0.508
this
Token this
Feature activation+0.000
(
Token (
Feature activation+0.000
well
Tokenwell
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
worst
Token worst
Feature activation+0.000
couples
Token couples
Feature activation+1.150
)
Token)
Feature activation+0.000
have
Token have
Feature activation+0.167
begun
Token begun
Feature activation+0.000
to
Token to
Feature activation+0.000
hire
Token hire
Feature activation+0.209

INTERVAL 0.761 - 1.014
CONTAINS 0.004%

Magazine
Token Magazine
Feature activation+0.000
ran
Token ran
Feature activation+0.000
the
Token the
Feature activation+0.000
spoiler
Token spoiler
Feature activation+0.000
a
Token a
Feature activation+0.000
couple
Token couple
Feature activation+0.998
of
Token of
Feature activation+0.000
weeks
Token weeks
Feature activation+0.000
ago
Token ago
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
in
Token in
Feature activation+0.000
office
Token office
Feature activation+0.000
who
Token who
Feature activation+0.242
fell
Token fell
Feature activation+0.541
in
Token in
Feature activation+0.000
love
Token love
Feature activation+0.868
with
Token with
Feature activation+0.717
me
Token me
Feature activation+0.256
and
Token and
Feature activation+0.110
I
Token I
Feature activation+0.242
was
Token was
Feature activation+0.096
manager
Token manager
Feature activation+0.000
married
Token married
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
beautiful
Token beautiful
Feature activation+0.000
woman
Token woman
Feature activation+0.812
his
Token his
Feature activation+0.000
own
Token own
Feature activation+0.000
age
Token age
Feature activation+0.000
,
Token,
Feature activation+0.000
with
Token with
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
was
Token was
Feature activation+0.000
rom
Token rom
Feature activation+0.000
antically
Tokenantically
Feature activation+1.015
involved
Token involved
Feature activation+0.873
with
Token with
Feature activation+0.410
Cobb
Token Cobb
Feature activation+0.000
before
Token before
Feature activation+0.000
they
Token they
Feature activation+0.583
stopped
Token stopped
Feature activation+0.114
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
He
TokenHe
Feature activation+0.050
said
Token said
Feature activation+0.000
that
Token that
Feature activation+0.000
Mrs
Token Mrs
Feature activation+0.774
Iv
Token Iv
Feature activation+0.000
lev
Tokenlev
Feature activation+0.000
a
Tokena
Feature activation+0.000
had
Token had
Feature activation+0.068
"
Token "
Feature activation+0.000

INTERVAL 0.507 - 0.761
CONTAINS 0.011%

in
Token in
Feature activation+0.000
jeopardy
Token jeopardy
Feature activation+0.000
by
Token by
Feature activation+0.000
talking
Token talking
Feature activation+0.000
to
Token to
Feature activation+0.000
Mrs
Token Mrs
Feature activation+0.510
.
Token.
Feature activation+0.000
Ph
Token Ph
Feature activation+0.000
ist
Tokenist
Feature activation+0.000
.
Token.
Feature activation+0.000
In
Token In
Feature activation+0.000
how
Token how
Feature activation+0.000
the
Token the
Feature activation+0.000
probe
Token probe
Feature activation+0.000
was
Token was
Feature activation+0.000
wed
Token wed
Feature activation+0.000
ged
Tokenged
Feature activation+0.601
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
shadowy
Token shadowy
Feature activation+0.000
crack
Token crack
Feature activation+0.000
.
Token.
Feature activation+0.000
sat
Token sat
Feature activation+0.000
in
Token in
Feature activation+0.000
our
Token our
Feature activation+0.000
cells
Token cells
Feature activation+0.000
for
Token for
Feature activation+0.000
two
Token two
Feature activation+0.524
months
Token months
Feature activation+0.000
before
Token before
Feature activation+0.000
our
Token our
Feature activation+0.000
families
Token families
Feature activation+0.000
had
Token had
Feature activation+0.028
Lauren
Token Lauren
Feature activation+0.000
More
Token More
Feature activation+0.000
lli
Tokenlli
Feature activation+0.000
and
Token and
Feature activation+0.000
her
Token her
Feature activation+0.120
husband
Token husband
Feature activation+0.721
of
Token of
Feature activation+0.000
two
Token two
Feature activation+1.310
years
Token years
Feature activation+0.000
,
Token,
Feature activation+0.000
Steve
Token Steve
Feature activation+0.000
is
Token is
Feature activation+0.000
to
Token to
Feature activation+0.000
destroy
Token destroy
Feature activation+0.000
my
Token my
Feature activation+0.000
spider
Token spider
Feature activation+0.000
bride
Token bride
Feature activation+0.650
with
Token with
Feature activation+0.000
fire
Token fire
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
sort
Token sort
Feature activation+0.000

INTERVAL 0.254 - 0.507
CONTAINS 0.020%

was
Token was
Feature activation+0.096
kind
Token kind
Feature activation+0.188
of
Token of
Feature activation+0.000
attracted
Token attracted
Feature activation+1.073
to
Token to
Feature activation+0.182
her
Token her
Feature activation+0.278
.
Token.
Feature activation+0.000
At
Token At
Feature activation+0.000
one
Token one
Feature activation+0.648
point
Token point
Feature activation+0.176
of
Token of
Feature activation+0.000
s
Token s
Feature activation+0.048
*
Token*
Feature activation+0.000
xual
Tokenxual
Feature activation+0.275
contacts
Token contacts
Feature activation+0.000
with
Token with
Feature activation+0.290
another
Token another
Feature activation+0.311
women
Token women
Feature activation+0.743
(
Token(
Feature activation+0.000
I
TokenI
Feature activation+0.137
never
Token never
Feature activation+0.000
had
Token had
Feature activation+0.463
role
Token role
Feature activation+0.000
.
Token.
Feature activation+0.000
When
Token When
Feature activation+0.000
this
Token this
Feature activation+0.000
relationship
Token relationship
Feature activation+0.000
between
Token between
Feature activation+0.286
Cam
Token Cam
Feature activation+0.000
illa
Tokenilla
Feature activation+0.000
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
director
Token director
Feature activation+0.000
not
Token not
Feature activation+0.000
identify
Token identify
Feature activation+0.000
as
Token as
Feature activation+0.000
heterosexual
Token heterosexual
Feature activation+0.000
white
Token white
Feature activation+0.000
men
Token men
Feature activation+0.307
âĢĶ
Token âĢĶ
Feature activation+0.000
will
Token will
Feature activation+0.000
not
Token not
Feature activation+0.000
be
Token be
Feature activation+0.000
able
Token able
Feature activation+0.000
the
Token the
Feature activation+0.000
UK
Token UK
Feature activation+0.000
after
Token after
Feature activation+0.050
marrying
Token marrying
Feature activation+1.181
a
Token a
Feature activation+0.192
cash
Token cash
Feature activation+0.365
-
Token-
Feature activation+0.000
stra
Tokenstra
Feature activation+0.000
pped
Tokenpped
Feature activation+0.000
Brit
Token Brit
Feature activation+0.000
on
Tokenon
Feature activation+0.000

INTERVAL 0.000 - 0.254
CONTAINS 99.958%

The
TokenThe
Feature activation+0.000
quantity
Token quantity
Feature activation+0.000
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
quality
Token quality
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
shark
Token shark
Feature activation+0.000
teeth
Token teeth
Feature activation+0.000
and
Token and
Feature activation+0.000
fossils
Token fossils
Feature activation+0.000
the
Token the
Feature activation+0.000
rite
Token rite
Feature activation+0.000
of
Token of
Feature activation+0.000
kind
Token kind
Feature activation+0.000
ling
Tokenling
Feature activation+0.000
and
Token and
Feature activation+0.000
kind
Token kind
Feature activation+0.000
ling
Tokenling
Feature activation+0.000
bon
Token bon
Feature activation+0.000
fires
Tokenfires
Feature activation+0.000
,
Token,
Feature activation+0.000
J
TokenJ
Feature activation+0.000
),
Token),
Feature activation+0.000
Brazil
Token Brazil
Feature activation+0.000
,
Token,
Feature activation+0.000
approved
Token approved
Feature activation+0.000
at
Token at
Feature activation+0.000
the
Token the
Feature activation+0.000
1
Token 1
Feature activation+0.000
st
Tokenst
Feature activation+0.000
FAR
Token FAR
Feature activation+0.000
J
TokenJ
Feature activation+0.000
React
Token React
Feature activation+0.000
Native
TokenNative
Feature activation+0.000
questions
Token questions
Feature activation+0.000
asked
Token asked
Feature activation+0.000
of
Token of
Feature activation+0.000
me
Token me
Feature activation+0.000
by
Token by
Feature activation+0.000
my
Token my
Feature activation+0.000
web
Token web
Feature activation+0.000
developer
Token developer
Feature activation+0.000
friends
Token friends
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
represent
Tokenrepresent
Feature activation+0.000
atives
Tokenatives
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
in
Token in
Feature activation+0.000
Washington
Token Washington
Feature activation+0.000
D
Token D
Feature activation+0.000
.
Token.
Feature activation+0.000
C
TokenC
Feature activation+0.000
.
Token.
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 4 in H1.6: (feature 6571

TOP ACTIVATIONS
MAX = 2.958

reason
Token reason
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
world
Token world
Feature activation+0.000
will
Token will
Feature activation+0.000
make
Token make
Feature activation+2.958
a
Token a
Feature activation+0.000
mother
Token mother
Feature activation+0.000
above
Token above
Feature activation+0.000
rep
Token rep
Feature activation+0.000
roach
Tokenroach
Feature activation+0.000
girl
Token girl
Feature activation+0.000
will
Token will
Feature activation+0.000
fall
Token fall
Feature activation+0.000
before
Token before
Feature activation+0.000
him
Token him
Feature activation+0.000
or
Token or
Feature activation+2.955
2
Token 2
Feature activation+0.000
)
Token)
Feature activation+0.000
he
Token he
Feature activation+0.000
will
Token will
Feature activation+0.000
fall
Token fall
Feature activation+0.000
borgh
Tokenborgh
Feature activation+0.000
ini
Tokenini
Feature activation+0.000
Count
Token Count
Feature activation+0.000
ach
Tokenach
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+2.641
a
Token a
Feature activation+0.000
Ferrari
Token Ferrari
Feature activation+0.000
512
Token 512
Feature activation+0.000
Berlin
Token Berlin
Feature activation+0.000
etta
Tokenetta
Feature activation+0.000
actually
Token actually
Feature activation+0.000
didn
Token didn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
or
Token or
Feature activation+2.297
think
Token think
Feature activation+0.000
we
Token we
Feature activation+0.000
made
Token made
Feature activation+1.708
a
Token a
Feature activation+0.000
different
Token different
Feature activation+0.000
User
Token User
Feature activation+0.000
has
Token has
Feature activation+0.000
following
Token following
Feature activation+0.000
choices
Token choices
Feature activation+0.000
to
Token to
Feature activation+0.127
make
Token make
Feature activation+2.296
to
Token to
Feature activation+0.107
do
Token do
Feature activation+0.000
the
Token the
Feature activation+0.000
deployment
Token deployment
Feature activation+0.000
.
Token.
Feature activation+0.000
go
Tokengo
Feature activation+0.000
sleeping
Token sleeping
Feature activation+0.000
that
Token that
Feature activation+0.108
night
Token night
Feature activation+0.000
and
Token and
Feature activation+0.000
make
Token make
Feature activation+2.179
it
Token it
Feature activation+0.000
happen
Token happen
Feature activation+0.000
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.089
did
Token did
Feature activation+0.000
choice
Token choice
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+0.000
yours
Token yours
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+2.037
.
Token.
Feature activation+0.000
A
Token A
Feature activation+0.000
new
Token new
Feature activation+0.000
framework
Token framework
Feature activation+0.000
for
Token for
Feature activation+0.000
had
Token had
Feature activation+0.000
little
Token little
Feature activation+0.000
choice
Token choice
Feature activation+0.000
but
Token but
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+2.001
the
Token the
Feature activation+0.000
annual
Token annual
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
doc
Tokendoc
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
lifestyle
Token lifestyle
Feature activation+0.000
choices
Token choices
Feature activation+0.034
they
Token they
Feature activation+0.000
make
Token make
Feature activation+1.941
.[
Token.[
Feature activation+0.000
1
Token1
Feature activation+0.000
]
Token]
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
choices
Token choices
Feature activation+0.244
she
Token she
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
made
Token made
Feature activation+1.894
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
people
Token people
Feature activation+0.000
she
Token she
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
that
Token that
Feature activation+0.000
kind
Token kind
Feature activation+0.029
of
Token of
Feature activation+0.000
publicity
Token publicity
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+1.837
wait
Token wait
Feature activation+0.000
for
Token for
Feature activation+0.000
him
Token him
Feature activation+0.000
to
Token to
Feature activation+0.000
wake
Token wake
Feature activation+0.000
the
Token the
Feature activation+0.000
situation
Token situation
Feature activation+0.000
and
Token and
Feature activation+0.000
continues
Token continues
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+1.711
the
Token the
Feature activation+0.000
impossible
Token impossible
Feature activation+0.000
decision
Token decision
Feature activation+0.000
when
Token when
Feature activation+0.000
necessary
Token necessary
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
or
Token or
Feature activation+2.297
think
Token think
Feature activation+0.000
we
Token we
Feature activation+0.000
made
Token made
Feature activation+1.708
a
Token a
Feature activation+0.000
different
Token different
Feature activation+0.000
choice
Token choice
Feature activation+0.000
than
Token than
Feature activation+0.247
we
Token we
Feature activation+0.000
to
Token to
Feature activation+0.000
stay
Token stay
Feature activation+0.000
the
Token the
Feature activation+0.000
course
Token course
Feature activation+0.000
with
Token with
Feature activation+0.000
Or
Token Or
Feature activation+1.629
ton
Tokenton
Feature activation+0.000
next
Token next
Feature activation+0.000
season
Token season
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
never
Token never
Feature activation+0.000
played
Token played
Feature activation+0.000
with
Token with
Feature activation+0.000
baby
Token baby
Feature activation+0.000
dolls
Token dolls
Feature activation+0.000
or
Token or
Feature activation+1.577
played
Token played
Feature activation+0.000
'
Token '
Feature activation+0.000
Mom
TokenMom
Feature activation+0.000
my
Tokenmy
Feature activation+0.000
.
Token.
Feature activation+0.000
preference
Token preference
Feature activation+0.448
for
Token for
Feature activation+0.941
a
Token a
Feature activation+0.000
smaller
Token smaller
Feature activation+0.000
family
Token family
Feature activation+0.000
over
Token over
Feature activation+1.565
a
Token a
Feature activation+0.000
larger
Token larger
Feature activation+0.000
one
Token one
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
that
Token that
Feature activation+0.000
's
Token's
Feature activation+0.000
why
Token why
Feature activation+0.000
we
Token we
Feature activation+0.000
cho
Token cho
Feature activation+0.000
osed
Tokenosed
Feature activation+1.556
whale
Token whale
Feature activation+0.000
penis
Token penis
Feature activation+0.000
le
Token le
Feature activation+0.000
ath
Tokenath
Feature activation+0.000
ure
Tokenure
Feature activation+0.000
going
Token going
Feature activation+0.000
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.000
to
Token to
Feature activation+0.000
choose
Token choose
Feature activation+0.000
between
Token between
Feature activation+1.555
the
Token the
Feature activation+0.000
G
Token G
Feature activation+0.000
3
Token3
Feature activation+0.000
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
's
Token's
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Cho
TokenCho
Feature activation+0.000
osing
Tokenosing
Feature activation+1.528
a
Token a
Feature activation+0.000
cabinet
Token cabinet
Feature activation+0.000
is
Token is
Feature activation+0.000
just
Token just
Feature activation+0.000
one
Token one
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
G
TokenG
Feature activation+0.000
athering
Tokenathering
Feature activation+0.000
information
Token information
Feature activation+0.000
before
Token before
Feature activation+0.000
making
Token making
Feature activation+1.479
a
Token a
Feature activation+0.000
decision
Token decision
Feature activation+0.000
has
Token has
Feature activation+0.000
been
Token been
Feature activation+0.000
considered
Token considered
Feature activation+0.560

Top DFA by src position
MAX = 4.109

or
Token or
Feature activation+0.020
Top resid features:
feel
Token feel
Feature activation+0.033
Top resid features:
better
Token better
Feature activation+0.062
Top resid features:
about
Token about
Feature activation+0.022
Top resid features:
your
Token your
Feature activation+0.053
Top resid features:
choices
Token choices
Feature activation+3.790
Top resid features:
by
Token by
Feature activation+0.012
Top resid features:
questioning
Token questioning
Feature activation+0.079
Top resid features:
those
Token those
Feature activation+0.035
Top resid features:
of
Token of
Feature activation-0.011
Top resid features:
others
Token others
Feature activation+0.051
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.107
Top resid features:
is
Token is
Feature activation+0.046
Top resid features:
given
Token given
Feature activation+0.044
Top resid features:
another
Token another
Feature activation+0.069
Top resid features:
choice
Token choice
Feature activation+4.109
Top resid features:
:
Token:
Feature activation+0.035
Top resid features:
1
Token 1
Feature activation+0.052
Top resid features:
)
Token)
Feature activation+0.019
Top resid features:
A
Token A
Feature activation+0.028
Top resid features:
beautiful
Token beautiful
Feature activation+0.034
Top resid features:
I
TokenI
Feature activation+0.010
Top resid features:
think
Token think
Feature activation+0.020
Top resid features:
you
Token you
Feature activation+0.016
Top resid features:
had
Token had
Feature activation-0.006
Top resid features:
a
Token a
Feature activation+0.012
Top resid features:
choice
Token choice
Feature activation+3.581
Top resid features:
of
Token of
Feature activation-0.003
Top resid features:
three
Token three
Feature activation+0.004
Top resid features:
posters
Token posters
Feature activation+0.013
Top resid features:
on
Token on
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.013
Top resid features:
to
Token to
Feature activation-0.004
Top resid features:
think
Token think
Feature activation+0.027
Top resid features:
we
Token we
Feature activation+0.020
Top resid features:
made
Token made
Feature activation+0.023
Top resid features:
a
Token a
Feature activation+0.018
Top resid features:
choice
Token choice
Feature activation+3.210
Top resid features:
when
Token when
Feature activation-0.006
Top resid features:
we
Token we
Feature activation+0.034
Top resid features:
actually
Token actually
Feature activation+0.007
Top resid features:
didn
Token didn
Feature activation+0.016
Top resid features:
âĢ
TokenâĢ
Feature activation-0.080
Top resid features:
services
Token services
Feature activation+0.055
Top resid features:
.
Token.
Feature activation-0.016
Top resid features:
User
Token User
Feature activation+0.066
Top resid features:
has
Token has
Feature activation+0.009
Top resid features:
following
Token following
Feature activation+0.029
Top resid features:
choices
Token choices
Feature activation+3.453
Top resid features:
to
Token to
Feature activation+0.029
Top resid features:
make
Token make
Feature activation+0.053
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
do
Token do
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.025
Top resid features:
with
Token with
Feature activation+0.124
Top resid features:
only
Token only
Feature activation+0.074
Top resid features:
one
Token one
Feature activation+0.072
Top resid features:
choice
Token choice
Feature activation+2.970
Top resid features:
:
Token:
Feature activation+0.043
Top resid features:
For
Token For
Feature activation+0.032
Top resid features:
go
Tokengo
Feature activation+0.187
Top resid features:
sleeping
Token sleeping
Feature activation+0.114
Top resid features:
that
Token that
Feature activation+0.100
Top resid features:
O
Token O
Feature activation+0.008
Top resid features:
PP
TokenPP
Feature activation+0.014
Top resid features:
T
TokenT
Feature activation+0.011
Top resid features:
)
Token)
Feature activation-0.027
Top resid features:
the
Token the
Feature activation+0.028
Top resid features:
choice
Token choice
Feature activation+3.399
Top resid features:
is
Token is
Feature activation-0.040
Top resid features:
now
Token now
Feature activation-0.015
Top resid features:
yours
Token yours
Feature activation+0.049
Top resid features:
to
Token to
Feature activation+0.071
Top resid features:
make
Token make
Feature activation+0.083
Top resid features:
physician
Token physician
Feature activation+0.007
Top resid features:
lobby
Token lobby
Feature activation-0.012
Top resid features:
has
Token has
Feature activation+0.001
Top resid features:
had
Token had
Feature activation-0.006
Top resid features:
little
Token little
Feature activation+0.042
Top resid features:
choice
Token choice
Feature activation+2.990
Top resid features:
but
Token but
Feature activation-0.028
Top resid features:
to
Token to
Feature activation+0.137
Top resid features:
make
Token make
Feature activation+0.042
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
annual
Token annual
Feature activation+0.000
Top resid features:
,
Token,
Feature activation-0.007
Top resid features:
but
Token but
Feature activation+0.041
Top resid features:
by
Token by
Feature activation+0.042
Top resid features:
the
Token the
Feature activation+0.105
Top resid features:
lifestyle
Token lifestyle
Feature activation+0.254
Top resid features:
choices
Token choices
Feature activation+2.749
Top resid features:
they
Token they
Feature activation+0.172
Top resid features:
make
Token make
Feature activation-0.170
Top resid features:
.[
Token.[
Feature activation+0.000
Top resid features:
1
Token1
Feature activation+0.000
Top resid features:
]
Token]
Feature activation+0.000
Top resid features:
Taylor
Token Taylor
Feature activation-0.021
Top resid features:
)
Token)
Feature activation-0.006
Top resid features:
lives
Token lives
Feature activation+0.069
Top resid features:
with
Token with
Feature activation+0.013
Top resid features:
the
Token the
Feature activation+0.041
Top resid features:
choices
Token choices
Feature activation+3.172
Top resid features:
she
Token she
Feature activation+0.019
Top resid features:
âĢ
TokenâĢ
Feature activation-0.083
Top resid features:
Ļ
TokenĻ
Feature activation+0.020
Top resid features:
s
Tokens
Feature activation-0.022
Top resid features:
made
Token made
Feature activation-0.130
Top resid features:
,
Token,
Feature activation-0.006
Top resid features:
so
Token so
Feature activation+0.004
Top resid features:
they
Token they
Feature activation+0.010
Top resid features:
had
Token had
Feature activation-0.002
Top resid features:
a
Token a
Feature activation+0.012
Top resid features:
choice
Token choice
Feature activation+2.346
Top resid features:
:
Token:
Feature activation+0.002
Top resid features:
either
Token either
Feature activation+0.493
Top resid features:
call
Token call
Feature activation+0.014
Top resid features:
the
Token the
Feature activation+0.023
Top resid features:
authorities
Token authorities
Feature activation-0.008
Top resid features:
Taylor
Token Taylor
Feature activation-0.005
Top resid features:
)
Token)
Feature activation+0.008
Top resid features:
lives
Token lives
Feature activation+0.011
Top resid features:
with
Token with
Feature activation+0.015
Top resid features:
the
Token the
Feature activation+0.024
Top resid features:
choices
Token choices
Feature activation+2.398
Top resid features:
she
Token she
Feature activation-0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.005
Top resid features:
Ļ
TokenĻ
Feature activation-0.001
Top resid features:
s
Tokens
Feature activation+0.010
Top resid features:
made
Token made
Feature activation-0.012
Top resid features:
to
Token to
Feature activation+0.002
Top resid features:
think
Token think
Feature activation+0.027
Top resid features:
we
Token we
Feature activation+0.015
Top resid features:
made
Token made
Feature activation+0.062
Top resid features:
a
Token a
Feature activation+0.015
Top resid features:
choice
Token choice
Feature activation+2.595
Top resid features:
when
Token when
Feature activation-0.012
Top resid features:
we
Token we
Feature activation+0.020
Top resid features:
actually
Token actually
Feature activation+0.001
Top resid features:
didn
Token didn
Feature activation+0.005
Top resid features:
âĢ
TokenâĢ
Feature activation-0.032
Top resid features:
suggests
Token suggests
Feature activation+0.066
Top resid features:
the
Token the
Feature activation+0.004
Top resid features:
Bills
Token Bills
Feature activation-0.050
Top resid features:
have
Token have
Feature activation+0.004
Top resid features:
little
Token little
Feature activation+0.024
Top resid features:
choice
Token choice
Feature activation+2.453
Top resid features:
but
Token but
Feature activation-0.013
Top resid features:
to
Token to
Feature activation-0.005
Top resid features:
stay
Token stay
Feature activation+0.034
Top resid features:
the
Token the
Feature activation+0.043
Top resid features:
course
Token course
Feature activation-0.005
Top resid features:
-
Token-
Feature activation+0.001
Top resid features:
free
Tokenfree
Feature activation+0.006
Top resid features:
is
Token is
Feature activation+0.006
Top resid features:
a
Token a
Feature activation+0.017
Top resid features:
conscious
Token conscious
Feature activation+0.005
Top resid features:
choice
Token choice
Feature activation+2.364
Top resid features:
Ċ
TokenĊ
Feature activation+0.005
Top resid features:
Ċ
TokenĊ
Feature activation+0.005
Top resid features:
Image
TokenImage
Feature activation-0.004
Top resid features:
:
Token:
Feature activation-0.007
Top resid features:
G
Token G
Feature activation-0.003
Top resid features:
reflecting
Token reflecting
Feature activation+0.013
Top resid features:
a
Token a
Feature activation+0.001
Top resid features:
decades
Token decades
Feature activation+0.030
Top resid features:
-
Token-
Feature activation+0.002
Top resid features:
long
Tokenlong
Feature activation+0.034
Top resid features:
preference
Token preference
Feature activation+2.489
Top resid features:
for
Token for
Feature activation-0.004
Top resid features:
a
Token a
Feature activation-0.019
Top resid features:
smaller
Token smaller
Feature activation+0.035
Top resid features:
family
Token family
Feature activation+0.076
Top resid features:
over
Token over
Feature activation-0.051
Top resid features:
and
Token and
Feature activation-0.012
Top resid features:
that
Token that
Feature activation-0.016
Top resid features:
's
Token's
Feature activation-0.022
Top resid features:
why
Token why
Feature activation-0.025
Top resid features:
we
Token we
Feature activation-0.043
Top resid features:
cho
Token cho
Feature activation+2.988
Top resid features:
osed
Tokenosed
Feature activation-0.004
Top resid features:
whale
Token whale
Feature activation+0.000
Top resid features:
penis
Token penis
Feature activation+0.000
Top resid features:
le
Token le
Feature activation+0.000
Top resid features:
ath
Tokenath
Feature activation+0.000
Top resid features:
are
Token are
Feature activation+0.001
Top resid features:
going
Token going
Feature activation+0.002
Top resid features:
to
Token to
Feature activation+0.006
Top resid features:
have
Token have
Feature activation+0.001
Top resid features:
to
Token to
Feature activation-0.004
Top resid features:
choose
Token choose
Feature activation+2.142
Top resid features:
between
Token between
Feature activation+0.113
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
G
Token G
Feature activation+0.000
Top resid features:
3
Token3
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
Harper
Token Harper
Feature activation+0.011
Top resid features:
's
Token's
Feature activation-0.009
Top resid features:
.
Token.
Feature activation-0.016
Top resid features:
Ċ
TokenĊ
Feature activation+0.010
Top resid features:
Ċ
TokenĊ
Feature activation-0.005
Top resid features:
Cho
TokenCho
Feature activation+3.017
Top resid features:
osing
Tokenosing
Feature activation-0.067
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
cabinet
Token cabinet
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
just
Token just
Feature activation+0.000
Top resid features:
information
Token information
Feature activation+0.031
Top resid features:
before
Token before
Feature activation+0.014
Top resid features:
committing
Token committing
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.012
Top resid features:
a
Token a
Feature activation+0.035
Top resid features:
choice
Token choice
Feature activation+2.669
Top resid features:
.
Token.
Feature activation-0.031
Top resid features:
Ċ
TokenĊ
Feature activation-0.003
Top resid features:
Ċ
TokenĊ
Feature activation-0.015
Top resid features:
G
TokenG
Feature activation+0.038
Top resid features:
athering
Tokenathering
Feature activation+0.008
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.06

Head 2: 0.04

Head 3: 0.02

Head 4: 0.04

Head 5: 0.05

Head 6: 0.54

Head 7: 0.05

Head 8: 0.02

Head 9: 0.02

Head 10: 0.03

Head 11: 0.06

Positive logits

choices3.04

choice2.73

selections2.44

options2.44

selection2.31

CHO2.24

option2.17

Choice2.17

alternatives2.15

chose2.10

preferences2.09

choosing2.02

utilitarian2.00

choice1.96

Option1.96

customization1.95

convenience1.94

simplicity1.93

pairing1.93

selecting1.91

Negative logits

bad-2.02

monton-2.01

forestation-1.97

dig-1.92

worm-1.90

trace-1.86

auri-1.86

dro-1.80

recorded-1.78

awar-1.76

adal-1.76

ujah-1.75

anus-1.75

oen-1.74

wave-1.73

surface-1.73

deep-1.72

ongo-1.72

gur-1.71

heres-1.71

INTERVAL 2.662 - 2.958
CONTAINS 0.000%

girl
Token girl
Feature activation+0.000
will
Token will
Feature activation+0.000
fall
Token fall
Feature activation+0.000
before
Token before
Feature activation+0.000
him
Token him
Feature activation+0.000
or
Token or
Feature activation+2.955
2
Token 2
Feature activation+0.000
)
Token)
Feature activation+0.000
he
Token he
Feature activation+0.000
will
Token will
Feature activation+0.000
fall
Token fall
Feature activation+0.000
reason
Token reason
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
world
Token world
Feature activation+0.000
will
Token will
Feature activation+0.000
make
Token make
Feature activation+2.958
a
Token a
Feature activation+0.000
mother
Token mother
Feature activation+0.000
above
Token above
Feature activation+0.000
rep
Token rep
Feature activation+0.000
roach
Tokenroach
Feature activation+0.000

INTERVAL 2.366 - 2.662
CONTAINS 0.000%

borgh
Tokenborgh
Feature activation+0.000
ini
Tokenini
Feature activation+0.000
Count
Token Count
Feature activation+0.000
ach
Tokenach
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+2.641
a
Token a
Feature activation+0.000
Ferrari
Token Ferrari
Feature activation+0.000
512
Token 512
Feature activation+0.000
Berlin
Token Berlin
Feature activation+0.000
etta
Tokenetta
Feature activation+0.000

INTERVAL 2.071 - 2.366
CONTAINS 0.000%

actually
Token actually
Feature activation+0.000
didn
Token didn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
or
Token or
Feature activation+2.297
think
Token think
Feature activation+0.000
we
Token we
Feature activation+0.000
made
Token made
Feature activation+1.708
a
Token a
Feature activation+0.000
different
Token different
Feature activation+0.000
go
Tokengo
Feature activation+0.000
sleeping
Token sleeping
Feature activation+0.000
that
Token that
Feature activation+0.108
night
Token night
Feature activation+0.000
and
Token and
Feature activation+0.000
make
Token make
Feature activation+2.179
it
Token it
Feature activation+0.000
happen
Token happen
Feature activation+0.000
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.089
did
Token did
Feature activation+0.000
User
Token User
Feature activation+0.000
has
Token has
Feature activation+0.000
following
Token following
Feature activation+0.000
choices
Token choices
Feature activation+0.000
to
Token to
Feature activation+0.127
make
Token make
Feature activation+2.296
to
Token to
Feature activation+0.107
do
Token do
Feature activation+0.000
the
Token the
Feature activation+0.000
deployment
Token deployment
Feature activation+0.000
.
Token.
Feature activation+0.000

INTERVAL 1.775 - 2.071
CONTAINS 0.001%

by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
lifestyle
Token lifestyle
Feature activation+0.000
choices
Token choices
Feature activation+0.034
they
Token they
Feature activation+0.000
make
Token make
Feature activation+1.941
.[
Token.[
Feature activation+0.000
1
Token1
Feature activation+0.000
]
Token]
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
choice
Token choice
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+0.000
yours
Token yours
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+2.037
.
Token.
Feature activation+0.000
A
Token A
Feature activation+0.000
new
Token new
Feature activation+0.000
framework
Token framework
Feature activation+0.000
for
Token for
Feature activation+0.000
that
Token that
Feature activation+0.000
kind
Token kind
Feature activation+0.029
of
Token of
Feature activation+0.000
publicity
Token publicity
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+1.837
wait
Token wait
Feature activation+0.000
for
Token for
Feature activation+0.000
him
Token him
Feature activation+0.000
to
Token to
Feature activation+0.000
wake
Token wake
Feature activation+0.000
choices
Token choices
Feature activation+0.244
she
Token she
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
made
Token made
Feature activation+1.894
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
people
Token people
Feature activation+0.000
she
Token she
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
had
Token had
Feature activation+0.000
little
Token little
Feature activation+0.000
choice
Token choice
Feature activation+0.000
but
Token but
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+2.001
the
Token the
Feature activation+0.000
annual
Token annual
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
doc
Tokendoc
Feature activation+0.000

INTERVAL 1.479 - 1.775
CONTAINS 0.001%

never
Token never
Feature activation+0.000
played
Token played
Feature activation+0.000
with
Token with
Feature activation+0.000
baby
Token baby
Feature activation+0.000
dolls
Token dolls
Feature activation+0.000
or
Token or
Feature activation+1.577
played
Token played
Feature activation+0.000
'
Token '
Feature activation+0.000
Mom
TokenMom
Feature activation+0.000
my
Tokenmy
Feature activation+0.000
.
Token.
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
or
Token or
Feature activation+2.297
think
Token think
Feature activation+0.000
we
Token we
Feature activation+0.000
made
Token made
Feature activation+1.708
a
Token a
Feature activation+0.000
different
Token different
Feature activation+0.000
choice
Token choice
Feature activation+0.000
than
Token than
Feature activation+0.247
we
Token we
Feature activation+0.000
the
Token the
Feature activation+0.000
situation
Token situation
Feature activation+0.000
and
Token and
Feature activation+0.000
continues
Token continues
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+1.711
the
Token the
Feature activation+0.000
impossible
Token impossible
Feature activation+0.000
decision
Token decision
Feature activation+0.000
when
Token when
Feature activation+0.000
necessary
Token necessary
Feature activation+0.000
going
Token going
Feature activation+0.000
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.000
to
Token to
Feature activation+0.000
choose
Token choose
Feature activation+0.000
between
Token between
Feature activation+1.555
the
Token the
Feature activation+0.000
G
Token G
Feature activation+0.000
3
Token3
Feature activation+0.000
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
preference
Token preference
Feature activation+0.448
for
Token for
Feature activation+0.941
a
Token a
Feature activation+0.000
smaller
Token smaller
Feature activation+0.000
family
Token family
Feature activation+0.000
over
Token over
Feature activation+1.565
a
Token a
Feature activation+0.000
larger
Token larger
Feature activation+0.000
one
Token one
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

INTERVAL 1.183 - 1.479
CONTAINS 0.001%

preferring
Token preferring
Feature activation+0.000
studies
Token studies
Feature activation+0.000
that
Token that
Feature activation+0.000
use
Token use
Feature activation+0.000
matching
Token matching
Feature activation+0.000
over
Token over
Feature activation+1.285
studies
Token studies
Feature activation+0.000
that
Token that
Feature activation+0.000
do
Token do
Feature activation+0.000
not
Token not
Feature activation+0.000
.
Token.
Feature activation+0.000
campaign
Token campaign
Feature activation+0.000
.
Token.
Feature activation+0.000
Asked
Token Asked
Feature activation+0.000
to
Token to
Feature activation+0.000
choose
Token choose
Feature activation+0.000
between
Token between
Feature activation+1.294
the
Token the
Feature activation+0.000
opposing
Token opposing
Feature activation+0.000
statements
Token statements
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
It
Token It
Feature activation+0.000
's
Token's
Feature activation+0.000
time
Token time
Feature activation+0.000
to
Token to
Feature activation+0.000
choose
Token choose
Feature activation+0.079
between
Token between
Feature activation+1.321
two
Token two
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
hottest
Token hottest
Feature activation+0.000
Android
Token Android
Feature activation+0.000
a
Token a
Feature activation+0.000
target
Token target
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
then
Token then
Feature activation+0.000
over
Token over
Feature activation+1.207
time
Token time
Feature activation+0.000
slowly
Token slowly
Feature activation+0.000
start
Token start
Feature activation+0.000
infiltr
Token infiltr
Feature activation+0.000
ating
Tokenating
Feature activation+0.000
more
Token more
Feature activation+0.000
likely
Token likely
Feature activation+0.000
to
Token to
Feature activation+0.000
choose
Token choose
Feature activation+0.000
abortion
Token abortion
Feature activation+0.000
over
Token over
Feature activation+1.470
white
Token white
Feature activation+0.000
women
Token women
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

INTERVAL 0.887 - 1.183
CONTAINS 0.001%

when
Token when
Feature activation+0.000
there
Token there
Feature activation+0.000
was
Token was
Feature activation+0.000
no
Token no
Feature activation+0.000
other
Token other
Feature activation+0.000
choice
Token choice
Feature activation+1.049
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Not
TokenNot
Feature activation+0.000
necessarily
Token necessarily
Feature activation+0.000
in
Token in
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
then
Token then
Feature activation+0.000
select
Token select
Feature activation+0.492
from
Token from
Feature activation+1.082
a
Token a
Feature activation+0.000
list
Token list
Feature activation+0.000
of
Token of
Feature activation+0.000
somewhat
Token somewhat
Feature activation+0.000
vague
Token vague
Feature activation+0.000
why
Tokenwhy
Feature activation+0.000
did
Token did
Feature activation+0.000
you
Token you
Feature activation+0.000
choose
Token choose
Feature activation+0.000
acting
Token acting
Feature activation+0.000
over
Token over
Feature activation+1.042
sports
Token sports
Feature activation+0.000
and
Token and
Feature activation+0.000
music
Token music
Feature activation+0.000
?
Token?
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
I
Token I
Feature activation+0.000
chose
Token chose
Feature activation+0.000
two
Token two
Feature activation+0.000
different
Token different
Feature activation+0.000
select
Token select
Feature activation+0.000
able
Tokenable
Feature activation+0.992
methods
Token methods
Feature activation+0.000
for
Token for
Feature activation+0.148
this
Token this
Feature activation+0.000
.
Token.
Feature activation+0.000
First
Token First
Feature activation+0.000
40
Token 40
Feature activation+0.000
sites
Token sites
Feature activation+0.000
of
Token of
Feature activation+0.000
their
Token their
Feature activation+0.000
choosing
Token choosing
Feature activation+0.023
between
Token between
Feature activation+1.033
Cant
Token Cant
Feature activation+0.000
ara
Tokenara
Feature activation+0.000
and
Token and
Feature activation+0.000
Dog
Token Dog
Feature activation+0.000
Creek
Token Creek
Feature activation+0.000

INTERVAL 0.592 - 0.887
CONTAINS 0.003%

would
Token would
Feature activation+0.000
have
Token have
Feature activation+0.000
been
Token been
Feature activation+0.000
the
Token the
Feature activation+0.000
ideal
Token ideal
Feature activation+0.000
choice
Token choice
Feature activation+0.647
for
Token for
Feature activation+0.000
Cincinnati
Token Cincinnati
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
with
Token with
Feature activation+0.000
memoir
Token memoir
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
Hard
TokenHard
Feature activation+0.000
Cho
Token Cho
Feature activation+0.000
ices
Tokenices
Feature activation+0.778
,
Token,
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
described
Token described
Feature activation+0.000
Mr
Token Mr
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+0.000
a
Token a
Feature activation+0.000
choice
Token choice
Feature activation+0.700
.
Token.
Feature activation+0.000
Which
Token Which
Feature activation+0.837
side
Token side
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
table
Token table
Feature activation+0.000
are
Token are
Feature activation+0.000
objects
Token objects
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
hope
Token hope
Feature activation+0.000
you
Token you
Feature activation+0.000
picked
Token picked
Feature activation+0.665
the
Token the
Feature activation+0.000
right
Token right
Feature activation+0.000
one
Token one
Feature activation+0.000
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.000
need
Token need
Feature activation+0.000
to
Token to
Feature activation+0.000
change
Token change
Feature activation+0.000
your
Token your
Feature activation+0.000
pledge
Token pledge
Feature activation+0.000
or
Token or
Feature activation+0.621
up
Token up
Feature activation+0.000
your
Token your
Feature activation+0.000
pledge
Token pledge
Feature activation+0.000
for
Token for
Feature activation+0.000
the
Token the
Feature activation+0.000

INTERVAL 0.296 - 0.592
CONTAINS 0.009%

will
Token will
Feature activation+0.000
select
Token select
Feature activation+0.000
the
Token the
Feature activation+0.000
proper
Token proper
Feature activation+0.000
driver
Token driver
Feature activation+0.000
from
Token from
Feature activation+0.319
the
Token the
Feature activation+0.000
list
Token list
Feature activation+0.000
of
Token of
Feature activation+0.000
installed
Token installed
Feature activation+0.000
drivers
Token drivers
Feature activation+0.000
margin
Token margin
Feature activation+0.000
of
Token of
Feature activation+0.000
support
Token support
Feature activation+0.000
for
Token for
Feature activation+0.000
Clinton
Token Clinton
Feature activation+0.000
over
Token over
Feature activation+0.355
Trump
Token Trump
Feature activation+0.000
is
Token is
Feature activation+0.000
still
Token still
Feature activation+0.000
13
Token 13
Feature activation+0.000
points
Token points
Feature activation+0.000
know
Token know
Feature activation+0.000
hundreds
Token hundreds
Feature activation+0.000
of
Token of
Feature activation+0.000
Christians
Token Christians
Feature activation+0.000
who
Token who
Feature activation+0.000
would
Token would
Feature activation+0.308
open
Token open
Feature activation+0.000
the
Token the
Feature activation+0.000
same
Token same
Feature activation+0.000
Bible
Token Bible
Feature activation+0.000
you
Token you
Feature activation+0.000
not
Token not
Feature activation+0.000
be
Token be
Feature activation+0.000
the
Token the
Feature activation+0.000
worst
Token worst
Feature activation+0.000
apparel
Token apparel
Feature activation+0.000
decision
Token decision
Feature activation+0.329
ever
Token ever
Feature activation+0.000
but
Token but
Feature activation+0.000
it
Token it
Feature activation+0.000
most
Token most
Feature activation+0.000
certainly
Token certainly
Feature activation+0.000
decisions
Token decisions
Feature activation+0.000
will
Token will
Feature activation+0.000
have
Token have
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
made
Token made
Feature activation+0.415
.
Token.
Feature activation+0.000
Some
Token Some
Feature activation+0.000
players
Token players
Feature activation+0.000
won
Token won
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000

INTERVAL 0.000 - 0.296
CONTAINS 99.983%

reports
Token reports
Feature activation+0.000
may
Token may
Feature activation+0.000
be
Token be
Feature activation+0.000
believed
Token believed
Feature activation+0.000
that
Token that
Feature activation+0.000
surveillance
Token surveillance
Feature activation+0.000
was
Token was
Feature activation+0.000
indeed
Token indeed
Feature activation+0.000
undertaken
Token undertaken
Feature activation+0.000
against
Token against
Feature activation+0.000
me
Token me
Feature activation+0.000
correct
Token correct
Feature activation+0.000
society
Token society
Feature activation+0.000
these
Token these
Feature activation+0.000
days
Token days
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
everybody
Token everybody
Feature activation+0.000
's
Token's
Feature activation+0.000
watching
Token watching
Feature activation+0.000
the
Token the
Feature activation+0.000
words
Token words
Feature activation+0.000
oa
Tokenoa
Feature activation+0.000
is
Tokenis
Feature activation+0.000
e
Token e
Feature activation+0.000
ambient
Token ambient
Feature activation+0.000
ais
Tokenais
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
,
Token,
Feature activation+0.000
av
Token av
Feature activation+0.000
alia
Tokenalia
Feature activation+0.000
.
Token.
Feature activation+0.000
â̦
Token â̦
Feature activation+0.000
If
Token If
Feature activation+0.000
there
Token there
Feature activation+0.000
are
Token are
Feature activation+0.000
good
Token good
Feature activation+0.000
ideas
Token ideas
Feature activation+0.000
that
Token that
Feature activation+0.000
Sen
Token Sen
Feature activation+0.000
.
Token.
Feature activation+0.000
Cruz
Token Cruz
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
House
Token House
Feature activation+0.000
âĢĶ
TokenâĢĶ
Feature activation+0.000
inspired
Tokeninspired
Feature activation+0.000
and
Token and
Feature activation+0.000
encouraged
Token encouraged
Feature activation+0.000
by
Token by
Feature activation+0.000
Cruz
Token Cruz
Feature activation+0.000
âĢĶ
TokenâĢĶ
Feature activation+0.000
re
Tokenre
Feature activation+0.000
vol
Tokenvol
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 5 in H1.6: (feature 14559

TOP ACTIVATIONS
MAX = 3.850

countries
Token countries
Feature activation+2.656
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
50
Token 50
Feature activation+0.000
largest
Token largest
Feature activation+0.000
camps
Token camps
Feature activation+3.850
,
Token,
Feature activation+0.000
featured
Token featured
Feature activation+0.000
on
Token on
Feature activation+0.228
the
Token the
Feature activation+0.000
above
Token above
Feature activation+0.000
that
Token that
Feature activation+0.000
went
Token went
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+3.652
in
Token in
Feature activation+0.000
Liberia
Token Liberia
Feature activation+0.602
in
Token in
Feature activation+0.000
May
Token May
Feature activation+0.000
was
Token was
Feature activation+0.000
multi
Token multi
Feature activation+0.357
-
Token-
Feature activation+0.000
year
Tokenyear
Feature activation+0.745
commitments
Token commitments
Feature activation+0.402
to
Token to
Feature activation+0.427
res
Token res
Feature activation+3.569
ettle
Tokenettle
Feature activation+2.873
refugees
Token refugees
Feature activation+0.000
from
Token from
Feature activation+2.646
Erit
Token Erit
Feature activation+1.008
rea
Tokenrea
Feature activation+0.000
.
Token.
Feature activation+0.000
6
Token6
Feature activation+0.000
million
Token million
Feature activation+0.284
people
Token people
Feature activation+1.204
have
Token have
Feature activation+0.000
fled
Token fled
Feature activation+3.567
Syria
Token Syria
Feature activation+2.040
during
Token during
Feature activation+0.042
the
Token the
Feature activation+0.000
country
Token country
Feature activation+1.518
âĢ
TokenâĢ
Feature activation+0.000
-
Token-
Feature activation+0.000
saving
Tokensaving
Feature activation+0.154
supplies
Token supplies
Feature activation+0.000
at
Token at
Feature activation+0.122
refugee
Token refugee
Feature activation+0.216
camps
Token camps
Feature activation+3.312
in
Token in
Feature activation+0.553
more
Token more
Feature activation+0.000
than
Token than
Feature activation+0.000
125
Token 125
Feature activation+0.000
countries
Token countries
Feature activation+2.656
part
Token part
Feature activation+0.000
and
Token and
Feature activation+0.000
parcel
Token parcel
Feature activation+0.024
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
foreign
Token foreign
Feature activation+3.198
policy
Token policy
Feature activation+0.846
that
Token that
Feature activation+0.000
serves
Token serves
Feature activation+0.415
no
Token no
Feature activation+0.000
actual
Token actual
Feature activation+0.000
passing
Token passing
Feature activation+0.000
through
Token through
Feature activation+0.000
Bal
Token Bal
Feature activation+0.000
ata
Tokenata
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+3.191
.
Token.
Feature activation+0.000
As
Token As
Feature activation+0.000
demonstrators
Token demonstrators
Feature activation+0.000
arrived
Token arrived
Feature activation+2.995
at
Token at
Feature activation+0.264
there
Token there
Feature activation+0.249
are
Token are
Feature activation+0.000
few
Token few
Feature activation+0.000
,
Token,
Feature activation+0.000
can
Token can
Feature activation+0.030
stay
Token stay
Feature activation+3.134
.
Token.
Feature activation+0.000
One
Token One
Feature activation+0.000
cannot
Token cannot
Feature activation+0.155
fors
Token fors
Feature activation+0.041
ake
Tokenake
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
scrambling
Token scrambling
Feature activation+0.000
to
Token to
Feature activation+0.000
res
Token res
Feature activation+3.043
ettle
Tokenettle
Feature activation+2.953
Syrian
Token Syrian
Feature activation+2.654
refugees
Token refugees
Feature activation+0.000
but
Token but
Feature activation+0.000
won
Token won
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+3.191
.
Token.
Feature activation+0.000
As
Token As
Feature activation+0.000
demonstrators
Token demonstrators
Feature activation+0.000
arrived
Token arrived
Feature activation+2.995
at
Token at
Feature activation+0.264
the
Token the
Feature activation+0.000
junction
Token junction
Feature activation+0.000
next
Token next
Feature activation+0.000
to
Token to
Feature activation+0.015
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
scrambling
Token scrambling
Feature activation+0.000
to
Token to
Feature activation+0.000
res
Token res
Feature activation+3.043
ettle
Tokenettle
Feature activation+2.953
Syrian
Token Syrian
Feature activation+2.654
refugees
Token refugees
Feature activation+0.000
but
Token but
Feature activation+0.000
won
Token won
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
-
Token-
Feature activation+0.000
year
Tokenyear
Feature activation+0.745
commitments
Token commitments
Feature activation+0.402
to
Token to
Feature activation+0.427
res
Token res
Feature activation+3.569
ettle
Tokenettle
Feature activation+2.873
refugees
Token refugees
Feature activation+0.000
from
Token from
Feature activation+2.646
Erit
Token Erit
Feature activation+1.008
rea
Tokenrea
Feature activation+0.000
,
Token,
Feature activation+0.000
displaced
Token displaced
Feature activation+0.000
Iraqis
Token Iraqis
Feature activation+0.000
and
Token and
Feature activation+0.000
Syrian
Token Syrian
Feature activation+0.000
refugees
Token refugees
Feature activation+0.000
fled
Token fled
Feature activation+2.865
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
autonomous
Token autonomous
Feature activation+0.000
region
Token region
Feature activation+0.000
.
Token.
Feature activation+0.000
hundreds
Token hundreds
Feature activation+0.000
of
Token of
Feature activation+0.000
thousands
Token thousands
Feature activation+0.000
of
Token of
Feature activation+0.000
people
Token people
Feature activation+1.157
arrive
Token arrive
Feature activation+2.834
each
Token each
Feature activation+0.000
month
Token month
Feature activation+0.006
to
Token to
Feature activation+0.008
escape
Token escape
Feature activation+2.444
the
Token the
Feature activation+0.000
gently
Tokengently
Feature activation+0.000
needed
Token needed
Feature activation+0.069
:
Token:
Feature activation+0.000
reduction
Token reduction
Feature activation+0.451
of
Token of
Feature activation+0.000
influx
Token influx
Feature activation+2.786
,
Token,
Feature activation+0.000
secure
Token secure
Feature activation+1.206
borders
Token borders
Feature activation+2.246
,
Token,
Feature activation+0.000
intens
Token intens
Feature activation+0.000
of
Token of
Feature activation+0.000
an
Token an
Feature activation+0.000
organization
Token organization
Feature activation+0.000
called
Token called
Feature activation+0.000
the
Token the
Feature activation+0.000
Syrian
Token Syrian
Feature activation+2.776
Human
Token Human
Feature activation+0.000
Rights
Token Rights
Feature activation+0.000
League
Token League
Feature activation+0.000
,
Token,
Feature activation+0.000
gained
Token gained
Feature activation+0.000
About
TokenAbout
Feature activation+0.000
20
Token 20
Feature activation+0.000
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
refugees
Token refugees
Feature activation+0.000
come
Token come
Feature activation+2.755
to
Token to
Feature activation+0.415
Canada
Token Canada
Feature activation+1.236
per
Token per
Feature activation+0.000
year
Token year
Feature activation+0.734
,
Token,
Feature activation+0.000
camps
Token camps
Feature activation+3.312
in
Token in
Feature activation+0.553
more
Token more
Feature activation+0.000
than
Token than
Feature activation+0.000
125
Token 125
Feature activation+0.000
countries
Token countries
Feature activation+2.656
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
50
Token 50
Feature activation+0.000
largest
Token largest
Feature activation+0.000
camps
Token camps
Feature activation+3.850
s
Tokens
Feature activation+0.000
scrambling
Token scrambling
Feature activation+0.000
to
Token to
Feature activation+0.000
res
Token res
Feature activation+3.043
ettle
Tokenettle
Feature activation+2.953
Syrian
Token Syrian
Feature activation+2.654
refugees
Token refugees
Feature activation+0.000
but
Token but
Feature activation+0.000
won
Token won
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
commitments
Token commitments
Feature activation+0.402
to
Token to
Feature activation+0.427
res
Token res
Feature activation+3.569
ettle
Tokenettle
Feature activation+2.873
refugees
Token refugees
Feature activation+0.000
from
Token from
Feature activation+2.646
Erit
Token Erit
Feature activation+1.008
rea
Tokenrea
Feature activation+0.000
,
Token,
Feature activation+0.000
Sudan
Token Sudan
Feature activation+1.496
,
Token,
Feature activation+0.000

Top DFA by src position
MAX = 5.039

United
Token United
Feature activation-0.006
Top resid features:
Nations
Token Nations
Feature activation+0.010
Top resid features:
High
Token High
Feature activation-0.008
Top resid features:
Commission
Token Commission
Feature activation-0.042
Top resid features:
on
Token on
Feature activation-0.007
Top resid features:
Refugees
Token Refugees
Feature activation+2.200
Top resid features:
offers
Token offers
Feature activation+0.021
Top resid features:
protection
Token protection
Feature activation+0.019
Top resid features:
and
Token and
Feature activation-0.019
Top resid features:
life
Token life
Feature activation+0.004
Top resid features:
-
Token-
Feature activation-0.012
Top resid features:
jan
Tokenjan
Feature activation+0.011
Top resid features:
that
Token that
Feature activation-0.050
Top resid features:
went
Token went
Feature activation-0.049
Top resid features:
to
Token to
Feature activation-0.073
Top resid features:
a
Token a
Feature activation-0.036
Top resid features:
refugee
Token refugee
Feature activation+5.039
Top resid features:
camp
Token camp
Feature activation+0.282
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
Liberia
Token Liberia
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
May
Token May
Feature activation+0.000
Top resid features:
will
Token will
Feature activation-0.005
Top resid features:
be
Token be
Feature activation+0.002
Top resid features:
reversed
Token reversed
Feature activation+0.003
Top resid features:
for
Token for
Feature activation-0.001
Top resid features:
all
Token all
Feature activation+0.007
Top resid features:
refugees
Token refugees
Feature activation+1.908
Top resid features:
.
Token.
Feature activation-0.006
Top resid features:
Ċ
TokenĊ
Feature activation-0.011
Top resid features:
Ċ
TokenĊ
Feature activation-0.011
Top resid features:
READ
TokenREAD
Feature activation-0.003
Top resid features:
MORE
Token MORE
Feature activation-0.003
Top resid features:
United
Token United
Feature activation-0.006
Top resid features:
Nations
Token Nations
Feature activation-0.008
Top resid features:
High
Token High
Feature activation-0.009
Top resid features:
Commission
Token Commission
Feature activation-0.024
Top resid features:
on
Token on
Feature activation-0.005
Top resid features:
Refugees
Token Refugees
Feature activation+2.050
Top resid features:
offers
Token offers
Feature activation+0.014
Top resid features:
protection
Token protection
Feature activation+0.011
Top resid features:
and
Token and
Feature activation-0.004
Top resid features:
life
Token life
Feature activation-0.003
Top resid features:
-
Token-
Feature activation-0.004
Top resid features:
life
Token life
Feature activation+0.009
Top resid features:
-
Token-
Feature activation-0.039
Top resid features:
saving
Tokensaving
Feature activation-0.016
Top resid features:
supplies
Token supplies
Feature activation-0.010
Top resid features:
at
Token at
Feature activation-0.069
Top resid features:
refugee
Token refugee
Feature activation+1.936
Top resid features:
camps
Token camps
Feature activation+0.128
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
more
Token more
Feature activation+0.000
Top resid features:
than
Token than
Feature activation+0.000
Top resid features:
125
Token 125
Feature activation+0.000
Top resid features:
.
Token.
Feature activation-0.008
Top resid features:
â̦
Token â̦
Feature activation-0.012
Top resid features:
These
Token These
Feature activation+0.012
Top resid features:
are
Token are
Feature activation+0.002
Top resid features:
stupid
Token stupid
Feature activation+0.004
Top resid features:
refugee
Token refugee
Feature activation+3.963
Top resid features:
programs
Token programs
Feature activation+0.023
Top resid features:
created
Token created
Feature activation+0.006
Top resid features:
by
Token by
Feature activation-0.012
Top resid features:
stupid
Token stupid
Feature activation-0.009
Top resid features:
politicians
Token politicians
Feature activation+0.063
Top resid features:
,
Token,
Feature activation-0.065
Top resid features:
passing
Token passing
Feature activation+0.033
Top resid features:
through
Token through
Feature activation+0.066
Top resid features:
Bal
Token Bal
Feature activation-0.084
Top resid features:
ata
Tokenata
Feature activation-0.153
Top resid features:
refugee
Token refugee
Feature activation+4.450
Top resid features:
camp
Token camp
Feature activation+0.078
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
As
Token As
Feature activation+0.000
Top resid features:
demonstrators
Token demonstrators
Feature activation+0.000
Top resid features:
arrived
Token arrived
Feature activation+0.000
Top resid features:
Y
Token Y
Feature activation-0.001
Top resid features:
ish
Tokenish
Feature activation+0.013
Top resid features:
ai
Tokenai
Feature activation-0.025
Top resid features:
differentiated
Token differentiated
Feature activation-0.020
Top resid features:
between
Token between
Feature activation-0.021
Top resid features:
refugees
Token refugees
Feature activation+2.120
Top resid features:
and
Token and
Feature activation-0.004
Top resid features:
asylum
Token asylum
Feature activation+1.458
Top resid features:
seekers
Token seekers
Feature activation-0.005
Top resid features:
,
Token,
Feature activation-0.020
Top resid features:
saying
Token saying
Feature activation+0.024
Top resid features:
will
Token will
Feature activation-0.014
Top resid features:
be
Token be
Feature activation-0.007
Top resid features:
reversed
Token reversed
Feature activation+0.013
Top resid features:
for
Token for
Feature activation-0.011
Top resid features:
all
Token all
Feature activation-0.002
Top resid features:
refugees
Token refugees
Feature activation+4.453
Top resid features:
.
Token.
Feature activation-0.022
Top resid features:
Ċ
TokenĊ
Feature activation-0.039
Top resid features:
Ċ
TokenĊ
Feature activation-0.040
Top resid features:
READ
TokenREAD
Feature activation-0.027
Top resid features:
MORE
Token MORE
Feature activation-0.049
Top resid features:
,
Token,
Feature activation-0.025
Top resid features:
passing
Token passing
Feature activation+0.048
Top resid features:
through
Token through
Feature activation+0.093
Top resid features:
Bal
Token Bal
Feature activation-0.049
Top resid features:
ata
Tokenata
Feature activation-0.057
Top resid features:
refugee
Token refugee
Feature activation+4.038
Top resid features:
camp
Token camp
Feature activation+0.162
Top resid features:
.
Token.
Feature activation-0.167
Top resid features:
As
Token As
Feature activation-0.070
Top resid features:
demonstrators
Token demonstrators
Feature activation-0.168
Top resid features:
arrived
Token arrived
Feature activation+0.214
Top resid features:
will
Token will
Feature activation-0.016
Top resid features:
be
Token be
Feature activation-0.012
Top resid features:
reversed
Token reversed
Feature activation+0.011
Top resid features:
for
Token for
Feature activation-0.022
Top resid features:
all
Token all
Feature activation+0.002
Top resid features:
refugees
Token refugees
Feature activation+4.452
Top resid features:
.
Token.
Feature activation-0.018
Top resid features:
Ċ
TokenĊ
Feature activation-0.053
Top resid features:
Ċ
TokenĊ
Feature activation-0.057
Top resid features:
READ
TokenREAD
Feature activation-0.028
Top resid features:
MORE
Token MORE
Feature activation-0.046
Top resid features:
will
Token will
Feature activation-0.005
Top resid features:
be
Token be
Feature activation-0.000
Top resid features:
reversed
Token reversed
Feature activation+0.000
Top resid features:
for
Token for
Feature activation-0.003
Top resid features:
all
Token all
Feature activation+0.007
Top resid features:
refugees
Token refugees
Feature activation+1.910
Top resid features:
.
Token.
Feature activation-0.006
Top resid features:
Ċ
TokenĊ
Feature activation-0.012
Top resid features:
Ċ
TokenĊ
Feature activation-0.012
Top resid features:
READ
TokenREAD
Feature activation-0.003
Top resid features:
MORE
Token MORE
Feature activation-0.003
Top resid features:
million
Token million
Feature activation-0.031
Top resid features:
displaced
Token displaced
Feature activation+0.257
Top resid features:
Iraqis
Token Iraqis
Feature activation-0.047
Top resid features:
and
Token and
Feature activation-0.085
Top resid features:
Syrian
Token Syrian
Feature activation+0.043
Top resid features:
refugees
Token refugees
Feature activation+3.473
Top resid features:
fled
Token fled
Feature activation+0.340
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
autonomous
Token autonomous
Feature activation+0.000
Top resid features:
region
Token region
Feature activation+0.000
Top resid features:
now
Token now
Feature activation-0.015
Top resid features:
worse
Token worse
Feature activation+0.004
Top resid features:
than
Token than
Feature activation-0.005
Top resid features:
those
Token those
Feature activation+0.002
Top resid features:
at
Token at
Feature activation-0.009
Top resid features:
refugee
Token refugee
Feature activation+3.767
Top resid features:
camps
Token camps
Feature activation+0.180
Top resid features:
in
Token in
Feature activation-0.005
Top resid features:
Turkey
Token Turkey
Feature activation-0.020
Top resid features:
,
Token,
Feature activation-0.075
Top resid features:
where
Token where
Feature activation-0.029
Top resid features:
to
Token to
Feature activation-0.007
Top resid features:
reduce
Token reduce
Feature activation-0.021
Top resid features:
its
Token its
Feature activation+0.008
Top resid features:
intake
Token intake
Feature activation-0.035
Top resid features:
of
Token of
Feature activation-0.006
Top resid features:
refugees
Token refugees
Feature activation+3.908
Top resid features:
.
Token.
Feature activation+0.005
Top resid features:
Ċ
TokenĊ
Feature activation-0.019
Top resid features:
Ċ
TokenĊ
Feature activation-0.016
Top resid features:
âĢ
TokenâĢ
Feature activation-0.025
Top resid features:
ľ
Tokenľ
Feature activation+0.000
Top resid features:
-
Token -
Feature activation-0.017
Top resid features:
and
Token and
Feature activation+0.003
Top resid features:
by
Token by
Feature activation-0.002
Top resid features:
extension
Token extension
Feature activation-0.005
Top resid features:
in
Token in
Feature activation+0.003
Top resid features:
refugee
Token refugee
Feature activation+3.615
Top resid features:
homes
Token homes
Feature activation+0.008
Top resid features:
-
Token -
Feature activation-0.016
Top resid features:
since
Token since
Feature activation+0.012
Top resid features:
the
Token the
Feature activation+0.007
Top resid features:
conflict
Token conflict
Feature activation+0.005
Top resid features:
will
Token will
Feature activation+0.009
Top resid features:
be
Token be
Feature activation+0.007
Top resid features:
reversed
Token reversed
Feature activation-0.022
Top resid features:
for
Token for
Feature activation+0.007
Top resid features:
all
Token all
Feature activation+0.008
Top resid features:
refugees
Token refugees
Feature activation+1.846
Top resid features:
.
Token.
Feature activation-0.008
Top resid features:
Ċ
TokenĊ
Feature activation-0.018
Top resid features:
Ċ
TokenĊ
Feature activation-0.019
Top resid features:
READ
TokenREAD
Feature activation+0.002
Top resid features:
MORE
Token MORE
Feature activation-0.000
Top resid features:
life
Token life
Feature activation+0.012
Top resid features:
-
Token-
Feature activation-0.020
Top resid features:
saving
Tokensaving
Feature activation+0.008
Top resid features:
supplies
Token supplies
Feature activation+0.006
Top resid features:
at
Token at
Feature activation-0.016
Top resid features:
refugee
Token refugee
Feature activation+1.231
Top resid features:
camps
Token camps
Feature activation+0.039
Top resid features:
in
Token in
Feature activation-0.005
Top resid features:
more
Token more
Feature activation-0.008
Top resid features:
than
Token than
Feature activation-0.042
Top resid features:
125
Token 125
Feature activation-0.055
Top resid features:
will
Token will
Feature activation-0.001
Top resid features:
be
Token be
Feature activation-0.009
Top resid features:
reversed
Token reversed
Feature activation+0.015
Top resid features:
for
Token for
Feature activation-0.003
Top resid features:
all
Token all
Feature activation+0.001
Top resid features:
refugees
Token refugees
Feature activation+3.976
Top resid features:
.
Token.
Feature activation-0.032
Top resid features:
Ċ
TokenĊ
Feature activation-0.035
Top resid features:
Ċ
TokenĊ
Feature activation-0.034
Top resid features:
READ
TokenREAD
Feature activation-0.021
Top resid features:
MORE
Token MORE
Feature activation-0.039
Top resid features:
year
Tokenyear
Feature activation-0.024
Top resid features:
commitments
Token commitments
Feature activation+0.026
Top resid features:
to
Token to
Feature activation-0.039
Top resid features:
res
Token res
Feature activation+0.023
Top resid features:
ettle
Tokenettle
Feature activation-0.013
Top resid features:
refugees
Token refugees
Feature activation+1.359
Top resid features:
from
Token from
Feature activation+0.132
Top resid features:
Erit
Token Erit
Feature activation+0.000
Top resid features:
rea
Tokenrea
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
Sudan
Token Sudan
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.04

Head 1: 0.03

Head 2: 0.03

Head 3: 0.02

Head 4: 0.04

Head 5: 0.10

Head 6: 0.54

Head 7: 0.06

Head 8: 0.03

Head 9: 0.03

Head 10: 0.03

Head 11: 0.06

Positive logits

Refugees2.74

refugees2.66

refugee2.54

resettlement2.52

Refugee2.43

UNHCR2.39

Shelter2.14

fleeing2.09

aboard2.07

igrants2.06

convoy2.05

seekers2.04

Kurds2.02

Syrians2.00

boarding2.00

idi1.95

migrants1.92

shelters1.92

Kurd1.91

merga1.90

Negative logits

antioxid-2.62

lapt-2.08

cryptoc-2.07

resemb-2.07

similarities-2.03

ATURE-1.99

ELD-1.92

MET-1.90

patents-1.89

antitrust-1.87

exponent-1.83

Trend-1.81

baugh-1.75

patented-1.74

edly-1.73

TABLE-1.71

XT-1.70

virtues-1.69

RAL-1.68

sonian-1.68

INTERVAL 3.465 - 3.850
CONTAINS 0.000%

countries
Token countries
Feature activation+2.656
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
50
Token 50
Feature activation+0.000
largest
Token largest
Feature activation+0.000
camps
Token camps
Feature activation+3.850
,
Token,
Feature activation+0.000
featured
Token featured
Feature activation+0.000
on
Token on
Feature activation+0.228
the
Token the
Feature activation+0.000
above
Token above
Feature activation+0.000
.
Token.
Feature activation+0.000
6
Token6
Feature activation+0.000
million
Token million
Feature activation+0.284
people
Token people
Feature activation+1.204
have
Token have
Feature activation+0.000
fled
Token fled
Feature activation+3.567
Syria
Token Syria
Feature activation+2.040
during
Token during
Feature activation+0.042
the
Token the
Feature activation+0.000
country
Token country
Feature activation+1.518
âĢ
TokenâĢ
Feature activation+0.000
that
Token that
Feature activation+0.000
went
Token went
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+3.652
in
Token in
Feature activation+0.000
Liberia
Token Liberia
Feature activation+0.602
in
Token in
Feature activation+0.000
May
Token May
Feature activation+0.000
was
Token was
Feature activation+0.000
multi
Token multi
Feature activation+0.357
-
Token-
Feature activation+0.000
year
Tokenyear
Feature activation+0.745
commitments
Token commitments
Feature activation+0.402
to
Token to
Feature activation+0.427
res
Token res
Feature activation+3.569
ettle
Tokenettle
Feature activation+2.873
refugees
Token refugees
Feature activation+0.000
from
Token from
Feature activation+2.646
Erit
Token Erit
Feature activation+1.008
rea
Tokenrea
Feature activation+0.000

INTERVAL 3.080 - 3.465
CONTAINS 0.000%

there
Token there
Feature activation+0.249
are
Token are
Feature activation+0.000
few
Token few
Feature activation+0.000
,
Token,
Feature activation+0.000
can
Token can
Feature activation+0.030
stay
Token stay
Feature activation+3.134
.
Token.
Feature activation+0.000
One
Token One
Feature activation+0.000
cannot
Token cannot
Feature activation+0.155
fors
Token fors
Feature activation+0.041
ake
Tokenake
Feature activation+0.000
part
Token part
Feature activation+0.000
and
Token and
Feature activation+0.000
parcel
Token parcel
Feature activation+0.024
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
foreign
Token foreign
Feature activation+3.198
policy
Token policy
Feature activation+0.846
that
Token that
Feature activation+0.000
serves
Token serves
Feature activation+0.415
no
Token no
Feature activation+0.000
actual
Token actual
Feature activation+0.000
passing
Token passing
Feature activation+0.000
through
Token through
Feature activation+0.000
Bal
Token Bal
Feature activation+0.000
ata
Tokenata
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+3.191
.
Token.
Feature activation+0.000
As
Token As
Feature activation+0.000
demonstrators
Token demonstrators
Feature activation+0.000
arrived
Token arrived
Feature activation+2.995
at
Token at
Feature activation+0.264
-
Token-
Feature activation+0.000
saving
Tokensaving
Feature activation+0.154
supplies
Token supplies
Feature activation+0.000
at
Token at
Feature activation+0.122
refugee
Token refugee
Feature activation+0.216
camps
Token camps
Feature activation+3.312
in
Token in
Feature activation+0.553
more
Token more
Feature activation+0.000
than
Token than
Feature activation+0.000
125
Token 125
Feature activation+0.000
countries
Token countries
Feature activation+2.656

INTERVAL 2.695 - 3.080
CONTAINS 0.001%

About
TokenAbout
Feature activation+0.000
20
Token 20
Feature activation+0.000
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
refugees
Token refugees
Feature activation+0.000
come
Token come
Feature activation+2.755
to
Token to
Feature activation+0.415
Canada
Token Canada
Feature activation+1.236
per
Token per
Feature activation+0.000
year
Token year
Feature activation+0.734
,
Token,
Feature activation+0.000
hundreds
Token hundreds
Feature activation+0.000
of
Token of
Feature activation+0.000
thousands
Token thousands
Feature activation+0.000
of
Token of
Feature activation+0.000
people
Token people
Feature activation+1.157
arrive
Token arrive
Feature activation+2.834
each
Token each
Feature activation+0.000
month
Token month
Feature activation+0.006
to
Token to
Feature activation+0.008
escape
Token escape
Feature activation+2.444
the
Token the
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+3.191
.
Token.
Feature activation+0.000
As
Token As
Feature activation+0.000
demonstrators
Token demonstrators
Feature activation+0.000
arrived
Token arrived
Feature activation+2.995
at
Token at
Feature activation+0.264
the
Token the
Feature activation+0.000
junction
Token junction
Feature activation+0.000
next
Token next
Feature activation+0.000
to
Token to
Feature activation+0.015
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
scrambling
Token scrambling
Feature activation+0.000
to
Token to
Feature activation+0.000
res
Token res
Feature activation+3.043
ettle
Tokenettle
Feature activation+2.953
Syrian
Token Syrian
Feature activation+2.654
refugees
Token refugees
Feature activation+0.000
but
Token but
Feature activation+0.000
won
Token won
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
-
Token-
Feature activation+0.000
year
Tokenyear
Feature activation+0.745
commitments
Token commitments
Feature activation+0.402
to
Token to
Feature activation+0.427
res
Token res
Feature activation+3.569
ettle
Tokenettle
Feature activation+2.873
refugees
Token refugees
Feature activation+0.000
from
Token from
Feature activation+2.646
Erit
Token Erit
Feature activation+1.008
rea
Tokenrea
Feature activation+0.000
,
Token,
Feature activation+0.000

INTERVAL 2.310 - 2.695
CONTAINS 0.001%

year
Token year
Feature activation+0.734
,
Token,
Feature activation+0.000
from
Token from
Feature activation+1.507
dozens
Token dozens
Feature activation+0.087
of
Token of
Feature activation+0.002
countries
Token countries
Feature activation+2.611
.
Token.
Feature activation+0.000
Canada
Token Canada
Feature activation+0.732
is
Token is
Feature activation+0.000
in
Token in
Feature activation+0.254
the
Token the
Feature activation+0.019
people
Token people
Feature activation+1.157
arrive
Token arrive
Feature activation+2.834
each
Token each
Feature activation+0.000
month
Token month
Feature activation+0.006
to
Token to
Feature activation+0.008
escape
Token escape
Feature activation+2.444
the
Token the
Feature activation+0.000
bloodshed
Token bloodshed
Feature activation+0.000
in
Token in
Feature activation+0.000
Syria
Token Syria
Feature activation+1.557
.
Token.
Feature activation+0.000
s
Tokens
Feature activation+0.000
scrambling
Token scrambling
Feature activation+0.000
to
Token to
Feature activation+0.000
res
Token res
Feature activation+3.043
ettle
Tokenettle
Feature activation+2.953
Syrian
Token Syrian
Feature activation+2.654
refugees
Token refugees
Feature activation+0.000
but
Token but
Feature activation+0.000
won
Token won
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
One
Token One
Feature activation+0.000
cannot
Token cannot
Feature activation+0.155
fors
Token fors
Feature activation+0.041
ake
Tokenake
Feature activation+0.000
the
Token the
Feature activation+0.109
security
Token security
Feature activation+2.417
of
Token of
Feature activation+0.113
Israelis
Token Israelis
Feature activation+0.314
."
Token."
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
camps
Token camps
Feature activation+3.312
in
Token in
Feature activation+0.553
more
Token more
Feature activation+0.000
than
Token than
Feature activation+0.000
125
Token 125
Feature activation+0.000
countries
Token countries
Feature activation+2.656
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
50
Token 50
Feature activation+0.000
largest
Token largest
Feature activation+0.000
camps
Token camps
Feature activation+3.850

INTERVAL 1.925 - 2.310
CONTAINS 0.001%

-
Token-
Feature activation+0.000
Assad
TokenAssad
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
initially
Token initially
Feature activation+0.000
fleeing
Token fleeing
Feature activation+1.950
to
Token to
Feature activation+0.000
Egypt
Token Egypt
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
High
Token High
Feature activation+0.000
Commission
Token Commission
Feature activation+0.000
on
Token on
Feature activation+0.000
Refugees
Token Refugees
Feature activation+0.000
offers
Token offers
Feature activation+0.509
protection
Token protection
Feature activation+2.268
and
Token and
Feature activation+0.000
life
Token life
Feature activation+0.684
-
Token-
Feature activation+0.000
saving
Tokensaving
Feature activation+0.154
supplies
Token supplies
Feature activation+0.000
country
Token country
Feature activation+1.817
does
Token does
Feature activation+0.000
not
Token not
Feature activation+0.000
want
Token want
Feature activation+0.318
to
Token to
Feature activation+0.000
accept
Token accept
Feature activation+2.188
more
Token more
Feature activation+0.000
refugees
Token refugees
Feature activation+0.151
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
criticized
Token criticized
Feature activation+0.000
of
Token of
Feature activation+0.000
airstrike
Token airstrike
Feature activation+0.000
at
Token at
Feature activation+0.000
Aleppo
Token Aleppo
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
camp
Token camp
Feature activation+1.952
site
Token site
Feature activation+0.000
-
Token -
Feature activation+0.000
Russian
Token Russian
Feature activation+0.000
MOD
Token MOD
Feature activation+0.000
https
Token https
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
"
Token"
Feature activation+0.000
In
TokenIn
Feature activation+0.000
Kos
Token Kos
Feature activation+2.249
and
Token and
Feature activation+0.000
Les
Token Les
Feature activation+0.260
bos
Tokenbos
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000

INTERVAL 1.540 - 1.925
CONTAINS 0.002%

ed
Tokened
Feature activation+0.000
/
Token/
Feature activation+0.000
Scan
TokenScan
Feature activation+0.029
p
Tokenp
Feature activation+0.000
ix
Tokenix
Feature activation+0.000
Denmark
Token Denmark
Feature activation+1.644
/
Token/
Feature activation+0.000
Files
TokenFiles
Feature activation+0.000
Tough
Token Tough
Feature activation+0.000
Legislation
Token Legislation
Feature activation+0.000
All
Token All
Feature activation+0.000
EU
Token EU
Feature activation+0.479
last
Token last
Feature activation+0.000
year
Token year
Feature activation+0.005
,
Token,
Feature activation+0.000
many
Token many
Feature activation+0.000
fleeing
Token fleeing
Feature activation+1.557
wars
Token wars
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
Middle
Token Middle
Feature activation+0.000
East
Token East
Feature activation+0.000
all
Token all
Feature activation+0.000
refugees
Token refugees
Feature activation+0.000
,
Token,
Feature activation+0.000
except
Token except
Feature activation+0.000
refugees
Token refugees
Feature activation+0.000
from
Token from
Feature activation+1.686
Syria
Token Syria
Feature activation+2.097
who
Token who
Feature activation+0.000
are
Token are
Feature activation+0.000
barred
Token barred
Feature activation+0.483
indefinitely
Token indefinitely
Feature activation+0.000
those
Token those
Feature activation+0.000
at
Token at
Feature activation+0.000
refugee
Token refugee
Feature activation+0.151
camps
Token camps
Feature activation+2.370
in
Token in
Feature activation+0.000
Turkey
Token Turkey
Feature activation+1.552
,
Token,
Feature activation+0.000
where
Token where
Feature activation+0.000
hundreds
Token hundreds
Feature activation+0.000
of
Token of
Feature activation+0.000
thousands
Token thousands
Feature activation+0.000
We
TokenWe
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
want
Token want
Feature activation+0.000
to
Token to
Feature activation+0.000
take
Token take
Feature activation+1.733
more
Token more
Feature activation+0.000
refugees
Token refugees
Feature activation+0.000
than
Token than
Feature activation+0.000
we
Token we
Feature activation+0.148
have
Token have
Feature activation+0.000

INTERVAL 1.155 - 1.540
CONTAINS 0.003%

said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Syrian
Token Syrian
Feature activation+1.180
manipulation
Token manipulation
Feature activation+0.000
:
Token:
Feature activation+0.000
Playing
Token Playing
Feature activation+0.000
human
Token human
Feature activation+0.000
misery
Token misery
Feature activation+0.000
tandem
Token tandem
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
Syrian
Token Syrian
Feature activation+1.306
Observatory
Token Observatory
Feature activation+0.000
for
Token for
Feature activation+0.000
Human
Token Human
Feature activation+0.000
Rights
Token Rights
Feature activation+0.000
and
Token and
Feature activation+0.000
than
Token than
Feature activation+0.000
1
Token 1
Feature activation+0.000
.
Token.
Feature activation+0.000
6
Token6
Feature activation+0.000
million
Token million
Feature activation+0.284
people
Token people
Feature activation+1.204
have
Token have
Feature activation+0.000
fled
Token fled
Feature activation+3.567
Syria
Token Syria
Feature activation+2.040
during
Token during
Feature activation+0.042
the
Token the
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
criticized
Token criticized
Feature activation+0.000
other
Token other
Feature activation+0.000
western
Token western
Feature activation+0.426
countries
Token countries
Feature activation+1.496
for
Token for
Feature activation+0.000
so
Token so
Feature activation+0.000
far
Token far
Feature activation+0.000
failing
Token failing
Feature activation+0.000
to
Token to
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
West
Token West
Feature activation+0.520
simultaneously
Token simultaneously
Feature activation+0.000
implements
Token implements
Feature activation+0.000
policies
Token policies
Feature activation+1.178
that
Token that
Feature activation+0.000
infl
Token infl
Feature activation+1.300
ame
Tokename
Feature activation+0.000
climate
Token climate
Feature activation+0.558
change
Token change
Feature activation+0.000

INTERVAL 0.770 - 1.155
CONTAINS 0.004%

were
Token were
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
ready
Tokenready
Feature activation+0.000
to
Token to
Feature activation+0.000
travel
Token travel
Feature activation+0.946
.
Token.
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
They
Token They
Feature activation+0.000
had
Token had
Feature activation+0.000
refugee
Token refugee
Feature activation+0.000
homes
Token homes
Feature activation+1.319
-
Token -
Feature activation+0.000
since
Token since
Feature activation+0.000
the
Token the
Feature activation+0.000
conflict
Token conflict
Feature activation+0.859
began
Token began
Feature activation+0.000
five
Token five
Feature activation+0.000
years
Token years
Feature activation+0.000
ago
Token ago
Feature activation+0.000
.
Token.
Feature activation+0.000
and
Token and
Feature activation+0.000
even
Token even
Feature activation+0.000
more
Token more
Feature activation+0.000
desperate
Token desperate
Feature activation+0.338
people
Token people
Feature activation+0.824
taking
Token taking
Feature activation+1.043
even
Token even
Feature activation+0.000
more
Token more
Feature activation+0.000
risks
Token risks
Feature activation+0.000
,
Token,
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
health
Token health
Feature activation+0.000
:
Token:
Feature activation+0.000
When
Token When
Feature activation+0.000
people
Token people
Feature activation+0.456
delay
Token delay
Feature activation+0.000
seeking
Token seeking
Feature activation+1.028
treatment
Token treatment
Feature activation+0.000
because
Token because
Feature activation+0.000
they
Token they
Feature activation+0.000
can
Token can
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
is
Token is
Feature activation+0.000
bad
Token bad
Feature activation+0.000
for
Token for
Feature activation+0.000
both
Token both
Feature activation+0.000
vulnerable
Token vulnerable
Feature activation+0.000
people
Token people
Feature activation+0.830
and
Token and
Feature activation+0.000
public
Token public
Feature activation+0.000
health
Token health
Feature activation+0.000
:
Token:
Feature activation+0.000
When
Token When
Feature activation+0.000

INTERVAL 0.385 - 0.770
CONTAINS 0.009%

refugees
Token refugees
Feature activation+0.000
come
Token come
Feature activation+2.755
to
Token to
Feature activation+0.415
Canada
Token Canada
Feature activation+1.236
per
Token per
Feature activation+0.000
year
Token year
Feature activation+0.734
,
Token,
Feature activation+0.000
from
Token from
Feature activation+1.507
dozens
Token dozens
Feature activation+0.087
of
Token of
Feature activation+0.002
countries
Token countries
Feature activation+2.611
fringe
Token fringe
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
foreign
Token foreign
Feature activation+2.069
and
Token and
Feature activation+0.000
national
Token national
Feature activation+0.641
security
Token security
Feature activation+0.000
policy
Token policy
Feature activation+0.000
agenda
Token agenda
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
as
Token as
Feature activation+0.000
do
Token do
Feature activation+0.000
most
Token most
Feature activation+0.000
acknowled
Token acknowled
Feature activation+0.064
gments
Tokengments
Feature activation+0.000
from
Token from
Feature activation+0.726
the
Token the
Feature activation+0.000
world
Token world
Feature activation+0.816
of
Token of
Feature activation+0.000
white
Token white
Feature activation+0.606
-
Token-
Feature activation+0.000
and
Token and
Feature activation+0.000
children
Token children
Feature activation+0.000
.
Token.
Feature activation+0.000
For
Token For
Feature activation+0.000
34
Token 34
Feature activation+0.000
million
Token million
Feature activation+0.439
of
Token of
Feature activation+0.000
them
Token them
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
United
Token United
Feature activation+2.313
crisis
Token crisis
Feature activation+1.063
in
Token in
Feature activation+0.140
order
Token order
Feature activation+0.202
to
Token to
Feature activation+0.000
get
Token get
Feature activation+0.226
Sche
Token Sche
Feature activation+0.633
ng
Tokenng
Feature activation+0.000
en
Tokenen
Feature activation+0.000
back
Token back
Feature activation+0.314
up
Token up
Feature activation+0.000
and
Token and
Feature activation+0.000

INTERVAL 0.000 - 0.385
CONTAINS 99.979%

50
Token 50
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Everyone
TokenEveryone
Feature activation+0.000
knows
Token knows
Feature activation+0.000
that
Token that
Feature activation+0.000
needles
Token needles
Feature activation+0.000
shouldn
Token shouldn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
.
Token.
Feature activation+0.000
Although
Token Although
Feature activation+0.000
not
Token not
Feature activation+0.000
specifically
Token specifically
Feature activation+0.000
named
Token named
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
talented
Token talented
Feature activation+0.000
prospect
Token prospect
Feature activation+0.000
,
Token,
Feature activation+0.000
K
Token K
Feature activation+0.000
of
Token of
Feature activation+0.000
toler
Token toler
Feature activation+0.000
ation
Tokenation
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
But
TokenBut
Feature activation+0.000
the
Token the
Feature activation+0.000
mayors
Token mayors
Feature activation+0.000
also
Token also
Feature activation+0.000
called
Token called
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
A
TokenA
Feature activation+0.000
post
Token post
Feature activation+0.000
,
Token,
Feature activation+0.000
said
Token said
Feature activation+0.000
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.000
been
Token been
Feature activation+0.000
written
Token written
Feature activation+0.000
fe
Tokenfe
Feature activation+0.000
els
Tokenels
Feature activation+0.000
good
Token good
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
with
Token with
Feature activation+0.000
new
Token new
Feature activation+0.000
look
Token look
Feature activation+0.000
]
Token]
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 6 in H1.6: (feature 19420

TOP ACTIVATIONS
MAX = 4.006

a
Token a
Feature activation+0.000
little
Token little
Feature activation+0.000
bit
Token bit
Feature activation+0.000
of
Token of
Feature activation+0.000
my
Token my
Feature activation+0.000
women
Token women
Feature activation+4.006
's
Token's
Feature activation+0.000
touch
Token touch
Feature activation+0.249
in
Token in
Feature activation+0.000
this
Token this
Feature activation+0.000
which
Token which
Feature activation+0.000
the
Token the
Feature activation+0.000
odds
Token odds
Feature activation+0.000
of
Token of
Feature activation+0.000
having
Token having
Feature activation+0.000
zero
Token zero
Feature activation+0.526
women
Token women
Feature activation+3.655
speakers
Token speakers
Feature activation+0.000
at
Token at
Feature activation+0.000
a
Token a
Feature activation+0.000
math
Token math
Feature activation+0.000
conference
Token conference
Feature activation+0.000
then
Token then
Feature activation+0.000
we
Token we
Feature activation+0.138
could
Token could
Feature activation+0.000
note
Token note
Feature activation+0.000
that
Token that
Feature activation+0.000
male
Token male
Feature activation+3.614
authors
Token authors
Feature activation+0.000
constituted
Token constituted
Feature activation+0.138
11
Token 11
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
it
Token it
Feature activation+0.000
can
Token can
Feature activation+0.000
offer
Token offer
Feature activation+0.000
a
Token a
Feature activation+0.000
gender
Token gender
Feature activation+1.016
neutral
Token neutral
Feature activation+3.237
designation
Token designation
Feature activation+0.000
on
Token on
Feature activation+0.000
state
Token state
Feature activation+0.000
ID
Token ID
Feature activation+0.000
cards
Token cards
Feature activation+0.374
list
Token list
Feature activation+0.000
.
Token.
Feature activation+0.000
There
Token There
Feature activation+0.000
were
Token were
Feature activation+0.000
7
Token 7
Feature activation+0.000
female
Token female
Feature activation+2.850
authors
Token authors
Feature activation+0.000
listed
Token listed
Feature activation+0.000
and
Token and
Feature activation+0.000
there
Token there
Feature activation+0.000
were
Token were
Feature activation+0.000
election
Token election
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
SPD
Token SPD
Feature activation+0.000
's
Token's
Feature activation+0.000
men
Token men
Feature activation+2.649
may
Token may
Feature activation+0.000
now
Token now
Feature activation+0.000
take
Token take
Feature activation+0.000
these
Token these
Feature activation+0.000
calls
Token calls
Feature activation+0.014
identity
Token identity
Feature activation+2.263
as
Token as
Feature activation+0.723
the
Token the
Feature activation+0.000
student
Token student
Feature activation+0.000
's
Token's
Feature activation+0.000
sex
Token sex
Feature activation+2.583
for
Token for
Feature activation+0.000
purposes
Token purposes
Feature activation+0.000
of
Token of
Feature activation+0.000
Title
Token Title
Feature activation+0.000
IX
Token IX
Feature activation+0.000
become
Token become
Feature activation+0.000
irritating
Token irritating
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Women
TokenWomen
Feature activation+2.568
are
Token are
Feature activation+0.000
attracted
Token attracted
Feature activation+0.164
to
Token to
Feature activation+0.000
deeper
Token deeper
Feature activation+0.000
voices
Token voices
Feature activation+0.000
the
Token the
Feature activation+0.000
Minecraft
Token Minecraft
Feature activation+0.000
books
Token books
Feature activation+0.000
into
Token into
Feature activation+0.000
the
Token the
Feature activation+0.000
male
Token male
Feature activation+2.459
author
Token author
Feature activation+0.000
category
Token category
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
makes
Token makes
Feature activation+0.000
the
Token the
Feature activation+0.000
"
Token "
Feature activation+0.000
Min
TokenMin
Feature activation+0.000
istry
Tokenistry
Feature activation+0.000
for
Token for
Feature activation+0.000
Women
Token Women
Feature activation+2.457
and
Token and
Feature activation+0.000
Family
Token Family
Feature activation+0.000
"
Token"
Feature activation+0.000
will
Token will
Feature activation+0.000
be
Token be
Feature activation+0.000
gender
Token gender
Feature activation+0.000
inequality
Token inequality
Feature activation+0.000
and
Token and
Feature activation+0.000
violence
Token violence
Feature activation+0.000
against
Token against
Feature activation+0.000
women
Token women
Feature activation+2.397
,
Token,
Feature activation+0.000
Human
Token Human
Feature activation+0.000
Rights
Token Rights
Feature activation+0.000
Watch
Token Watch
Feature activation+0.000
said
Token said
Feature activation+0.000
Two
Token Two
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
identified
Token identified
Feature activation+1.034
as
Token as
Feature activation+0.412
male
Token male
Feature activation+2.352
,
Token,
Feature activation+0.000
women
Token women
Feature activation+1.713
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
transgender
Token transgender
Feature activation+0.000
UK
Token UK
Feature activation+0.000
,
Token,
Feature activation+0.000
4
Token 4
Feature activation+0.000
,
Token,
Feature activation+0.000
552
Token552
Feature activation+0.000
male
Token male
Feature activation+2.346
suicides
Token suicides
Feature activation+0.000
and
Token and
Feature activation+0.000
1
Token 1
Feature activation+0.000
,
Token,
Feature activation+0.000
493
Token493
Feature activation+0.000
is
Token is
Feature activation+0.000
very
Token very
Feature activation+0.000
different
Token different
Feature activation+0.000
.
Token.
Feature activation+0.000
Sometimes
Token Sometimes
Feature activation+0.000
guys
Token guys
Feature activation+2.292
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
see
Token see
Feature activation+0.000
things
Token things
Feature activation+0.000
we
Token we
Feature activation+0.000
When
TokenWhen
Feature activation+0.000
it
Token it
Feature activation+0.000
comes
Token comes
Feature activation+0.000
to
Token to
Feature activation+0.000
gender
Token gender
Feature activation+0.774
identity
Token identity
Feature activation+2.278
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
attorneys
Token attorneys
Feature activation+0.000
general
Token general
Feature activation+0.057
also
Token also
Feature activation+0.000
other
Token other
Feature activation+0.000
minorities
Token minorities
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
women
Token women
Feature activation+2.267
Michelle
Token Michelle
Feature activation+0.000
Obama
Token Obama
Feature activation+0.000
happened
Token happened
Feature activation+0.000
to
Token to
Feature activation+0.000
miss
Token miss
Feature activation+0.000
reat
Tokenreat
Feature activation+0.000
a
Token a
Feature activation+0.000
student
Token student
Feature activation+0.000
's
Token's
Feature activation+0.000
gender
Token gender
Feature activation+0.175
identity
Token identity
Feature activation+2.263
as
Token as
Feature activation+0.723
the
Token the
Feature activation+0.000
student
Token student
Feature activation+0.000
's
Token's
Feature activation+0.000
sex
Token sex
Feature activation+2.583
the
Token the
Feature activation+0.000
district
Token district
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
biologically
Token biologically
Feature activation+0.213
female
Token female
Feature activation+2.244
.
Token.
Feature activation+0.000
Female
Token Female
Feature activation+1.884
is
Token is
Feature activation+0.000
her
Token her
Feature activation+0.773
sex
Token sex
Feature activation+1.431
either
Token either
Feature activation+0.000
gender
Token gender
Feature activation+0.000
or
Token or
Feature activation+0.000
were
Token were
Feature activation+0.000
in
Token in
Feature activation+0.000
transition
Token transition
Feature activation+2.235
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
MON
TokenMON
Feature activation+0.000
T
TokenT
Feature activation+0.000
RE
TokenRE
Feature activation+0.000
-
Token-
Feature activation+0.000
needed
Tokenneeded
Feature activation+0.000
explicit
Token explicit
Feature activation+0.000
focus
Token focus
Feature activation+0.000
on
Token on
Feature activation+0.000
women
Token women
Feature activation+2.217
's
Token's
Feature activation+0.000
rights
Token rights
Feature activation+0.000
,
Token,
Feature activation+0.000
Human
Token Human
Feature activation+0.000
Rights
Token Rights
Feature activation+0.000

Top DFA by src position
MAX = 5.889

has
Token has
Feature activation+0.004
Top resid features:
been
Token been
Feature activation+0.002
Top resid features:
focused
Token focused
Feature activation+0.002
Top resid features:
on
Token on
Feature activation+0.002
Top resid features:
her
Token her
Feature activation+0.015
Top resid features:
gender
Token gender
Feature activation+5.511
Top resid features:
,
Token,
Feature activation-0.003
Top resid features:
Ra
Token Ra
Feature activation+0.003
Top resid features:
iche
Tokeniche
Feature activation+0.012
Top resid features:
keeps
Token keeps
Feature activation-0.006
Top resid features:
her
Token her
Feature activation+0.018
Top resid features:
and
Token and
Feature activation+0.002
Top resid features:
defensive
Token defensive
Feature activation+0.007
Top resid features:
people
Token people
Feature activation-0.011
Top resid features:
get
Token get
Feature activation-0.002
Top resid features:
when
Token when
Feature activation-0.003
Top resid features:
gender
Token gender
Feature activation+4.895
Top resid features:
disparity
Token disparity
Feature activation+0.169
Top resid features:
is
Token is
Feature activation+0.006
Top resid features:
pointed
Token pointed
Feature activation-0.007
Top resid features:
out
Token out
Feature activation+0.003
Top resid features:
.
Token.
Feature activation-0.026
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.098
Top resid features:
by
Token by
Feature activation+0.005
Top resid features:
gender
Token gender
Feature activation+5.889
Top resid features:
âĢĶ
Token âĢĶ
Feature activation+0.003
Top resid features:
which
Token which
Feature activation+0.008
Top resid features:
is
Token is
Feature activation+0.046
Top resid features:
subjective
Token subjective
Feature activation+0.012
Top resid features:
and
Token and
Feature activation+0.002
Top resid features:
nation
Token nation
Feature activation+0.017
Top resid features:
to
Token to
Feature activation+0.013
Top resid features:
allow
Token allow
Feature activation-0.000
Top resid features:
a
Token a
Feature activation+0.014
Top resid features:
third
Token third
Feature activation-0.003
Top resid features:
gender
Token gender
Feature activation+2.648
Top resid features:
option
Token option
Feature activation+0.048
Top resid features:
for
Token for
Feature activation+0.013
Top resid features:
driver
Token driver
Feature activation-0.015
Top resid features:
licenses
Token licenses
Feature activation+0.017
Top resid features:
and
Token and
Feature activation+0.013
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.071
Top resid features:
by
Token by
Feature activation+0.006
Top resid features:
gender
Token gender
Feature activation+4.786
Top resid features:
âĢĶ
Token âĢĶ
Feature activation+0.006
Top resid features:
which
Token which
Feature activation+0.008
Top resid features:
is
Token is
Feature activation+0.038
Top resid features:
subjective
Token subjective
Feature activation+0.007
Top resid features:
and
Token and
Feature activation+0.011
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.102
Top resid features:
Gender
Token Gender
Feature activation+1.679
Top resid features:
Qu
Token Qu
Feature activation-0.006
Top resid features:
ota
Tokenota
Feature activation+0.010
Top resid features:
from
Token from
Feature activation+0.012
Top resid features:
Merkel
Token Merkel
Feature activation+0.016
Top resid features:
Female
Token Female
Feature activation+1.031
Top resid features:
t
Tokent
Feature activation-0.005
Top resid features:
reat
Tokenreat
Feature activation+0.018
Top resid features:
a
Token a
Feature activation-0.018
Top resid features:
student
Token student
Feature activation-0.002
Top resid features:
's
Token's
Feature activation+0.015
Top resid features:
gender
Token gender
Feature activation+3.214
Top resid features:
identity
Token identity
Feature activation+0.671
Top resid features:
as
Token as
Feature activation+0.016
Top resid features:
the
Token the
Feature activation-0.035
Top resid features:
student
Token student
Feature activation-0.018
Top resid features:
's
Token's
Feature activation+0.036
Top resid features:
same
Token same
Feature activation+0.043
Top resid features:
trend
Token trend
Feature activation+0.076
Top resid features:
exists
Token exists
Feature activation+0.010
Top resid features:
regardless
Token regardless
Feature activation+0.001
Top resid features:
of
Token of
Feature activation+0.003
Top resid features:
gender
Token gender
Feature activation+2.508
Top resid features:
.
Token.
Feature activation-0.005
Top resid features:
Ċ
TokenĊ
Feature activation+0.002
Top resid features:
Ċ
TokenĊ
Feature activation+0.002
Top resid features:
It
TokenIt
Feature activation+0.009
Top resid features:
âĢ
TokenâĢ
Feature activation-0.006
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.028
Top resid features:
by
Token by
Feature activation-0.003
Top resid features:
gender
Token gender
Feature activation+4.156
Top resid features:
âĢĶ
Token âĢĶ
Feature activation-0.029
Top resid features:
which
Token which
Feature activation+0.002
Top resid features:
is
Token is
Feature activation+0.007
Top resid features:
subjective
Token subjective
Feature activation-0.004
Top resid features:
and
Token and
Feature activation-0.003
Top resid features:
in
Token in
Feature activation+0.010
Top resid features:
its
Token its
Feature activation+0.016
Top resid features:
struggle
Token struggle
Feature activation+0.005
Top resid features:
to
Token to
Feature activation+0.019
Top resid features:
combat
Token combat
Feature activation+0.016
Top resid features:
gender
Token gender
Feature activation+3.309
Top resid features:
inequality
Token inequality
Feature activation+0.212
Top resid features:
and
Token and
Feature activation+0.004
Top resid features:
violence
Token violence
Feature activation+0.026
Top resid features:
against
Token against
Feature activation+0.016
Top resid features:
women
Token women
Feature activation+0.090
Top resid features:
in
Token in
Feature activation-0.001
Top resid features:
its
Token its
Feature activation+0.037
Top resid features:
struggle
Token struggle
Feature activation+0.006
Top resid features:
to
Token to
Feature activation+0.012
Top resid features:
combat
Token combat
Feature activation+0.025
Top resid features:
gender
Token gender
Feature activation+3.179
Top resid features:
inequality
Token inequality
Feature activation+0.333
Top resid features:
and
Token and
Feature activation-0.043
Top resid features:
violence
Token violence
Feature activation+0.034
Top resid features:
against
Token against
Feature activation+0.064
Top resid features:
women
Token women
Feature activation+0.109
Top resid features:
disobedience
Token disobedience
Feature activation-0.016
Top resid features:
.
Token.
Feature activation-0.003
Top resid features:
This
Token This
Feature activation+0.009
Top resid features:
book
Token book
Feature activation+0.000
Top resid features:
features
Token features
Feature activation-0.014
Top resid features:
gender
Token gender
Feature activation+3.383
Top resid features:
discrimination
Token discrimination
Feature activation+0.610
Top resid features:
and
Token and
Feature activation-0.003
Top resid features:
segregation
Token segregation
Feature activation+0.082
Top resid features:
.
Token.
Feature activation-0.063
Top resid features:
Thing
Token Thing
Feature activation+0.021
Top resid features:
M
Token M
Feature activation+0.003
Top resid features:
iser
Tokeniser
Feature activation+0.008
Top resid features:
ably
Tokenably
Feature activation+0.010
Top resid features:
insists
Token insists
Feature activation+0.011
Top resid features:
that
Token that
Feature activation+0.006
Top resid features:
gender
Token gender
Feature activation+3.915
Top resid features:
is
Token is
Feature activation+0.010
Top resid features:
key
Token key
Feature activation-0.002
Top resid features:
to
Token to
Feature activation+0.008
Top resid features:
tackling
Token tackling
Feature activation+0.022
Top resid features:
the
Token the
Feature activation+0.002
Top resid features:
has
Token has
Feature activation+0.006
Top resid features:
been
Token been
Feature activation+0.003
Top resid features:
focused
Token focused
Feature activation+0.001
Top resid features:
on
Token on
Feature activation+0.004
Top resid features:
her
Token her
Feature activation+0.037
Top resid features:
gender
Token gender
Feature activation+3.078
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
Ra
Token Ra
Feature activation+0.011
Top resid features:
iche
Tokeniche
Feature activation+0.007
Top resid features:
keeps
Token keeps
Feature activation-0.005
Top resid features:
her
Token her
Feature activation+0.035
Top resid features:
t
Tokent
Feature activation+0.017
Top resid features:
reat
Tokenreat
Feature activation+0.005
Top resid features:
a
Token a
Feature activation+0.020
Top resid features:
student
Token student
Feature activation-0.008
Top resid features:
's
Token's
Feature activation+0.018
Top resid features:
gender
Token gender
Feature activation+1.892
Top resid features:
identity
Token identity
Feature activation+0.363
Top resid features:
as
Token as
Feature activation+0.012
Top resid features:
the
Token the
Feature activation+0.024
Top resid features:
student
Token student
Feature activation-0.019
Top resid features:
's
Token's
Feature activation+0.015
Top resid features:
obsession
Token obsession
Feature activation-0.002
Top resid features:
with
Token with
Feature activation+0.008
Top resid features:
the
Token the
Feature activation+0.008
Top resid features:
color
Token color
Feature activation+0.011
Top resid features:
and
Token and
Feature activation+0.008
Top resid features:
gender
Token gender
Feature activation+3.220
Top resid features:
of
Token of
Feature activation+0.011
Top resid features:
people
Token people
Feature activation-0.005
Top resid features:
in
Token in
Feature activation+0.007
Top resid features:
the
Token the
Feature activation+0.005
Top resid features:
room
Token room
Feature activation-0.002
Top resid features:
t
Tokent
Feature activation+0.023
Top resid features:
reat
Tokenreat
Feature activation+0.012
Top resid features:
a
Token a
Feature activation-0.073
Top resid features:
student
Token student
Feature activation-0.002
Top resid features:
's
Token's
Feature activation+0.018
Top resid features:
gender
Token gender
Feature activation+3.293
Top resid features:
identity
Token identity
Feature activation+0.593
Top resid features:
as
Token as
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
student
Token student
Feature activation+0.000
Top resid features:
's
Token's
Feature activation+0.000
Top resid features:
limited
Token limited
Feature activation+0.001
Top resid features:
to
Token to
Feature activation+0.003
Top resid features:
the
Token the
Feature activation+0.002
Top resid features:
corresponding
Token corresponding
Feature activation+0.002
Top resid features:
biological
Token biological
Feature activation+0.019
Top resid features:
genders
Token genders
Feature activation+2.301
Top resid features:
,
Token,
Feature activation-0.009
Top resid features:
and
Token and
Feature activation-0.003
Top resid features:
students
Token students
Feature activation-0.011
Top resid features:
with
Token with
Feature activation+0.003
Top resid features:
gender
Token gender
Feature activation+1.130
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.220
Top resid features:
identify
Token identify
Feature activation+0.068
Top resid features:
with
Token with
Feature activation+0.026
Top resid features:
either
Token either
Feature activation+0.007
Top resid features:
gender
Token gender
Feature activation+3.506
Top resid features:
or
Token or
Feature activation+0.127
Top resid features:
were
Token were
Feature activation-0.044
Top resid features:
in
Token in
Feature activation+0.077
Top resid features:
transition
Token transition
Feature activation-0.086
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.007
Top resid features:
its
Token its
Feature activation+0.013
Top resid features:
struggle
Token struggle
Feature activation+0.005
Top resid features:
to
Token to
Feature activation+0.013
Top resid features:
combat
Token combat
Feature activation+0.012
Top resid features:
gender
Token gender
Feature activation+3.086
Top resid features:
inequality
Token inequality
Feature activation+0.292
Top resid features:
and
Token and
Feature activation+0.005
Top resid features:
violence
Token violence
Feature activation+0.024
Top resid features:
against
Token against
Feature activation+0.008
Top resid features:
women
Token women
Feature activation+0.048
Top resid features:

Decoder Weights Distribution

Head 0: 0.04

Head 1: 0.03

Head 2: 0.03

Head 3: 0.02

Head 4: 0.04

Head 5: 0.11

Head 6: 0.53

Head 7: 0.05

Head 8: 0.02

Head 9: 0.03

Head 10: 0.03

Head 11: 0.07

Positive logits

Gender2.59

pronouns2.49

gender2.49

genders2.46

endered2.44

stereotypes2.22

Caucasian2.18

stereotypical2.13

Females2.11

Transgender2.09

hijab2.08

gender2.07

Caucas2.06

Diversity2.06

stereotyp2.04

sexes2.03

Differences2.02

Discrimination2.01

equality2.00

gyn1.97

Negative logits

pled-1.94

ambush-1.88

hope-1.87

prosecuting-1.80

bombardment-1.80

deterrence-1.77

ofi-1.76

propell-1.76

detonated-1.75

gunfire-1.73

deliver-1.72

salv-1.70

delivered-1.70

hooting-1.67

stockp-1.66

delivering-1.66

sprayed-1.66

delivery-1.65

prosecut-1.65

advance-1.64

INTERVAL 3.605 - 4.006
CONTAINS 0.000%

the
Token the
Feature activation+0.000
odds
Token odds
Feature activation+0.000
of
Token of
Feature activation+0.000
having
Token having
Feature activation+0.000
zero
Token zero
Feature activation+0.526
women
Token women
Feature activation+3.655
speakers
Token speakers
Feature activation+0.000
at
Token at
Feature activation+0.000
a
Token a
Feature activation+0.000
math
Token math
Feature activation+0.000
conference
Token conference
Feature activation+0.000
then
Token then
Feature activation+0.000
we
Token we
Feature activation+0.138
could
Token could
Feature activation+0.000
note
Token note
Feature activation+0.000
that
Token that
Feature activation+0.000
male
Token male
Feature activation+3.614
authors
Token authors
Feature activation+0.000
constituted
Token constituted
Feature activation+0.138
11
Token 11
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
a
Token a
Feature activation+0.000
little
Token little
Feature activation+0.000
bit
Token bit
Feature activation+0.000
of
Token of
Feature activation+0.000
my
Token my
Feature activation+0.000
women
Token women
Feature activation+4.006
's
Token's
Feature activation+0.000
touch
Token touch
Feature activation+0.249
in
Token in
Feature activation+0.000
this
Token this
Feature activation+0.000
which
Token which
Feature activation+0.000

INTERVAL 3.205 - 3.605
CONTAINS 0.000%

it
Token it
Feature activation+0.000
can
Token can
Feature activation+0.000
offer
Token offer
Feature activation+0.000
a
Token a
Feature activation+0.000
gender
Token gender
Feature activation+1.016
neutral
Token neutral
Feature activation+3.237
designation
Token designation
Feature activation+0.000
on
Token on
Feature activation+0.000
state
Token state
Feature activation+0.000
ID
Token ID
Feature activation+0.000
cards
Token cards
Feature activation+0.374

INTERVAL 2.804 - 3.205
CONTAINS 0.000%

list
Token list
Feature activation+0.000
.
Token.
Feature activation+0.000
There
Token There
Feature activation+0.000
were
Token were
Feature activation+0.000
7
Token 7
Feature activation+0.000
female
Token female
Feature activation+2.850
authors
Token authors
Feature activation+0.000
listed
Token listed
Feature activation+0.000
and
Token and
Feature activation+0.000
there
Token there
Feature activation+0.000
were
Token were
Feature activation+0.000

INTERVAL 2.404 - 2.804
CONTAINS 0.001%

election
Token election
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
SPD
Token SPD
Feature activation+0.000
's
Token's
Feature activation+0.000
men
Token men
Feature activation+2.649
may
Token may
Feature activation+0.000
now
Token now
Feature activation+0.000
take
Token take
Feature activation+0.000
these
Token these
Feature activation+0.000
calls
Token calls
Feature activation+0.014
identity
Token identity
Feature activation+2.263
as
Token as
Feature activation+0.723
the
Token the
Feature activation+0.000
student
Token student
Feature activation+0.000
's
Token's
Feature activation+0.000
sex
Token sex
Feature activation+2.583
for
Token for
Feature activation+0.000
purposes
Token purposes
Feature activation+0.000
of
Token of
Feature activation+0.000
Title
Token Title
Feature activation+0.000
IX
Token IX
Feature activation+0.000
the
Token the
Feature activation+0.000
"
Token "
Feature activation+0.000
Min
TokenMin
Feature activation+0.000
istry
Tokenistry
Feature activation+0.000
for
Token for
Feature activation+0.000
Women
Token Women
Feature activation+2.457
and
Token and
Feature activation+0.000
Family
Token Family
Feature activation+0.000
"
Token"
Feature activation+0.000
will
Token will
Feature activation+0.000
be
Token be
Feature activation+0.000
become
Token become
Feature activation+0.000
irritating
Token irritating
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Women
TokenWomen
Feature activation+2.568
are
Token are
Feature activation+0.000
attracted
Token attracted
Feature activation+0.164
to
Token to
Feature activation+0.000
deeper
Token deeper
Feature activation+0.000
voices
Token voices
Feature activation+0.000
the
Token the
Feature activation+0.000
Minecraft
Token Minecraft
Feature activation+0.000
books
Token books
Feature activation+0.000
into
Token into
Feature activation+0.000
the
Token the
Feature activation+0.000
male
Token male
Feature activation+2.459
author
Token author
Feature activation+0.000
category
Token category
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
makes
Token makes
Feature activation+0.000

INTERVAL 2.003 - 2.404
CONTAINS 0.001%

Gender
Token Gender
Feature activation+0.000
Qu
Token Qu
Feature activation+0.000
ota
Tokenota
Feature activation+0.000
from
Token from
Feature activation+0.000
Merkel
Token Merkel
Feature activation+0.000
Female
Token Female
Feature activation+2.025
members
Token members
Feature activation+0.520
of
Token of
Feature activation+0.000
Germany
Token Germany
Feature activation+0.000
's
Token's
Feature activation+0.000
Social
Token Social
Feature activation+0.000
reat
Tokenreat
Feature activation+0.000
a
Token a
Feature activation+0.000
student
Token student
Feature activation+0.000
's
Token's
Feature activation+0.000
gender
Token gender
Feature activation+0.175
identity
Token identity
Feature activation+2.263
as
Token as
Feature activation+0.723
the
Token the
Feature activation+0.000
student
Token student
Feature activation+0.000
's
Token's
Feature activation+0.000
sex
Token sex
Feature activation+2.583
gender
Token gender
Feature activation+0.000
inequality
Token inequality
Feature activation+0.000
and
Token and
Feature activation+0.000
violence
Token violence
Feature activation+0.000
against
Token against
Feature activation+0.000
women
Token women
Feature activation+2.397
,
Token,
Feature activation+0.000
Human
Token Human
Feature activation+0.000
Rights
Token Rights
Feature activation+0.000
Watch
Token Watch
Feature activation+0.000
said
Token said
Feature activation+0.000
Two
Token Two
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
identified
Token identified
Feature activation+1.034
as
Token as
Feature activation+0.412
male
Token male
Feature activation+2.352
,
Token,
Feature activation+0.000
women
Token women
Feature activation+1.713
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
transgender
Token transgender
Feature activation+0.000
manager
Token manager
Feature activation+0.000
Ian
Token Ian
Feature activation+0.000
Doyle
Token Doyle
Feature activation+0.000
said
Token said
Feature activation+0.000
all
Token all
Feature activation+0.000
identifying
Token identifying
Feature activation+2.027
factors
Token factors
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
name
Token name
Feature activation+0.000
,
Token,
Feature activation+0.000
age
Token age
Feature activation+0.000

INTERVAL 1.602 - 2.003
CONTAINS 0.001%

got
Token got
Feature activation+0.000
into
Token into
Feature activation+0.000
some
Token some
Feature activation+0.000
relationships
Token relationships
Feature activation+0.457
with
Token with
Feature activation+0.000
men
Token men
Feature activation+1.779
who
Token who
Feature activation+0.000
only
Token only
Feature activation+0.000
loved
Token loved
Feature activation+0.000
me
Token me
Feature activation+0.000
because
Token because
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
"
Token"
Feature activation+0.000
The
TokenThe
Feature activation+0.000
men
Token men
Feature activation+1.719
that
Token that
Feature activation+0.000
are
Token are
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
Male
Token Male
Feature activation+0.196
pl
Tokenpl
Feature activation+0.000
ice
Tokenice
Feature activation+0.000
,
Token,
Feature activation+0.000
while
Token while
Feature activation+0.000
the
Token the
Feature activation+0.000
male
Token male
Feature activation+1.998
wears
Token wears
Feature activation+0.000
a
Token a
Feature activation+0.000
black
Token black
Feature activation+0.000
cloth
Token cloth
Feature activation+0.000
over
Token over
Feature activation+0.000
is
Token is
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
wine
Token wine
Feature activation+0.000
is
Token is
Feature activation+0.000
woman
Token woman
Feature activation+1.620
-
Token-
Feature activation+0.000
y
Tokeny
Feature activation+0.000
right
Token right
Feature activation+0.000
?).
Token?).
Feature activation+0.000
So
Token So
Feature activation+0.000
3
Token3
Feature activation+0.000
][
Token][
Feature activation+0.000
6
Token6
Feature activation+0.000
]
Token]
Feature activation+0.000
The
Token The
Feature activation+0.000
female
Token female
Feature activation+1.759
wears
Token wears
Feature activation+0.280
a
Token a
Feature activation+0.000
crown
Token crown
Feature activation+0.000
,
Token,
Feature activation+0.000
a
Token a
Feature activation+0.000

INTERVAL 1.202 - 1.602
CONTAINS 0.002%

of
Token of
Feature activation+0.000
her
Token her
Feature activation+0.199
life
Token life
Feature activation+0.000
as
Token as
Feature activation+0.000
a
Token a
Feature activation+0.000
man
Token man
Feature activation+1.346
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Morgan
TokenMorgan
Feature activation+0.000
,
Token,
Feature activation+0.000
a
Token a
Feature activation+0.000
sofa
Token sofa
Feature activation+0.000
across
Token across
Feature activation+0.000
from
Token from
Feature activation+0.000
a
Token a
Feature activation+0.000
man
Token man
Feature activation+1.300
he
Token he
Feature activation+0.000
believed
Token believed
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
a
Token a
Feature activation+0.000
from
Token from
Feature activation+0.000
resumes
Token resumes
Feature activation+0.000
submitted
Token submitted
Feature activation+0.000
for
Token for
Feature activation+0.000
senior
Token senior
Feature activation+0.000
roles
Token roles
Feature activation+1.281
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
the
Token the
Feature activation+0.000
White
Token White
Feature activation+0.000
House
Token House
Feature activation+0.000
Forum
Token Forum
Feature activation+0.000
on
Token on
Feature activation+0.000
Women
Token Women
Feature activation+1.507
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
Economy
Token Economy
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Moreover
Token Moreover
Feature activation+0.000
,
Token,
Feature activation+0.000
women
Token women
Feature activation+1.600
in
Token in
Feature activation+0.000
male
Token male
Feature activation+1.498
-
Token-
Feature activation+0.000
dominated
Tokendominated
Feature activation+0.052
majors
Token majors
Feature activation+0.000
do
Token do
Feature activation+0.000
not
Token not
Feature activation+0.000

INTERVAL 0.801 - 1.202
CONTAINS 0.004%

every
Token every
Feature activation+0.000
dollar
Token dollar
Feature activation+0.000
made
Token made
Feature activation+0.000
by
Token by
Feature activation+0.000
a
Token a
Feature activation+0.000
man
Token man
Feature activation+1.166
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
industry
Token industry
Feature activation+0.000
.
Token.
Feature activation+0.000
Women
Token Women
Feature activation+0.895
,
Token,
Feature activation+0.000
sub
Token sub
Feature activation+0.000
serv
Tokenserv
Feature activation+0.000
ience
Tokenience
Feature activation+0.000
to
Token to
Feature activation+0.000
men
Token men
Feature activation+1.047
;
Token;
Feature activation+0.000
their
Token their
Feature activation+0.000
flattened
Token flattened
Feature activation+0.000
selves
Token selves
Feature activation+0.000
.
Token.
Feature activation+0.000
likely
Token likely
Feature activation+0.000
to
Token to
Feature activation+0.000
switch
Token switch
Feature activation+0.152
out
Token out
Feature activation+0.000
of
Token of
Feature activation+0.000
male
Token male
Feature activation+1.036
-
Token-
Feature activation+0.000
dominated
Tokendominated
Feature activation+0.271
STEM
Token STEM
Feature activation+0.000
majors
Token majors
Feature activation+0.000
in
Token in
Feature activation+0.000
woman
Token woman
Feature activation+0.774
.
Token.
Feature activation+0.000
And
Token And
Feature activation+0.000
there
Token there
Feature activation+0.000
were
Token were
Feature activation+0.000
guys
Token guys
Feature activation+0.920
who
Token who
Feature activation+0.000
wouldn
Token wouldn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
advantage
Token advantage
Feature activation+0.000
of
Token of
Feature activation+0.000
trees
Token trees
Feature activation+0.000
,
Token,
Feature activation+0.000
especially
Token especially
Feature activation+0.000
female
Token female
Feature activation+0.986
trees
Token trees
Feature activation+0.000
.
Token.
Feature activation+0.000
It
Token It
Feature activation+0.000
also
Token also
Feature activation+0.000
encourages
Token encourages
Feature activation+0.000

INTERVAL 0.401 - 0.801
CONTAINS 0.006%

âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
the
Token the
Feature activation+0.000
only
Token only
Feature activation+0.000
guy
Token guy
Feature activation+0.702
who
Token who
Feature activation+0.000
he
Token he
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
technically
Token technically
Feature activation+0.000
ended
Token ended
Feature activation+0.000
in
Token in
Feature activation+0.000
2009
Token 2009
Feature activation+0.000
,
Token,
Feature activation+0.000
men
Token men
Feature activation+0.441
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
wage
Token wage
Feature activation+0.000
growth
Token growth
Feature activation+0.000
head
Token head
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
woman
Token woman
Feature activation+0.238
and
Token and
Feature activation+0.000
woman
Token woman
Feature activation+0.404
as
Token as
Feature activation+0.000
his
Token his
Feature activation+0.000
help
Token help
Feature activation+0.000
mate
Tokenmate
Feature activation+0.000
â̦
Tokenâ̦
Feature activation+0.000
some
Token some
Feature activation+0.000
trans
Token trans
Feature activation+0.000
,
Token,
Feature activation+0.000
almost
Token almost
Feature activation+0.000
all
Token all
Feature activation+0.000
women
Token women
Feature activation+0.752
of
Token of
Feature activation+0.000
color
Token color
Feature activation+0.000
,
Token,
Feature activation+0.000
never
Token never
Feature activation+0.000
do
Token do
Feature activation+0.002
not
Token not
Feature activation+0.000
get
Token get
Feature activation+0.000
very
Token very
Feature activation+0.000
tall
Token tall
Feature activation+0.000
and
Token and
Feature activation+0.000
sex
Token sex
Feature activation+0.435
very
Token very
Feature activation+0.000
quickly
Token quickly
Feature activation+0.000
...
Token...
Feature activation+0.000
So
Token So
Feature activation+0.000
that
Token that
Feature activation+0.000

INTERVAL 0.000 - 0.401
CONTAINS 99.985%

the
Token the
Feature activation+0.000
state
Token state
Feature activation+0.000
of
Token of
Feature activation+0.000
Colorado
Token Colorado
Feature activation+0.000
made
Token made
Feature activation+0.000
$
Token $
Feature activation+0.000
85
Token85
Feature activation+0.000
million
Token million
Feature activation+0.000
in
Token in
Feature activation+0.000
marijuana
Token marijuana
Feature activation+0.000
tax
Token tax
Feature activation+0.000
is
Token is
Feature activation+0.000
married
Token married
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
governor
Token governor
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
chief
Token chief
Feature activation+0.000
policy
Token policy
Feature activation+0.000
advisor
Token advisor
Feature activation+0.000
heart
Token heart
Feature activation+0.000
on
Token on
Feature activation+0.000
your
Token your
Feature activation+0.000
receipt
Token receipt
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
17
Token17
Feature activation+0.000
.
Token.
Feature activation+0.000
Watch
Token Watch
Feature activation+0.000
re
Token re
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
Eastern
Token Eastern
Feature activation+0.000
Conference
Token Conference
Feature activation+0.000
sem
Token sem
Feature activation+0.000
is
Tokenis
Feature activation+0.000
,
Token,
Feature activation+0.000
odds
Token odds
Feature activation+0.000
are
Token are
Feature activation+0.000
they
Token they
Feature activation+0.000
'd
Token'd
Feature activation+0.000
Post
Token Post
Feature activation+0.000
after
Token after
Feature activation+0.000
the
Token the
Feature activation+0.000
game
Token game
Feature activation+0.000
between
Token between
Feature activation+0.000
the
Token the
Feature activation+0.000
Knicks
Token Knicks
Feature activation+0.000
and
Token and
Feature activation+0.000
Raptors
Token Raptors
Feature activation+0.000
in
Token in
Feature activation+0.000
Toronto
Token Toronto
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 7 in H1.6: (feature 23126

TOP ACTIVATIONS
MAX = 3.418

its
Token its
Feature activation+0.019
beginning
Token beginning
Feature activation+0.214
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Watch
TokenWatch
Feature activation+3.418
MON
Token MON
Feature activation+0.000
ST
TokenST
Feature activation+0.000
ERS
TokenERS
Feature activation+0.000
in
Token in
Feature activation+0.000
its
Token its
Feature activation+0.001
hang
Token hang
Feature activation+0.000
man
Token man
Feature activation+0.000
I
Token I
Feature activation+0.000
would
Token would
Feature activation+0.000
have
Token have
Feature activation+0.000
shown
Token shown
Feature activation+3.299
the
Token the
Feature activation+0.000
pictures
Token pictures
Feature activation+0.917
of
Token of
Feature activation+0.000
all
Token all
Feature activation+0.000
his
Token his
Feature activation+0.000
that
Token that
Feature activation+0.130
it
Token it
Feature activation+0.663
needed
Token needed
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
seen
Token seen
Feature activation+3.199
on
Token on
Feature activation+0.291
the
Token the
Feature activation+0.271
biggest
Token biggest
Feature activation+0.003
screen
Token screen
Feature activation+1.140
possible
Token possible
Feature activation+0.000
movie
Token movie
Feature activation+0.404
again
Token again
Feature activation+0.000
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
more
Token more
Feature activation+0.000
watch
Token watch
Feature activation+3.186
able
Tokenable
Feature activation+0.000
form
Token form
Feature activation+0.000
,
Token,
Feature activation+0.000
I
Token I
Feature activation+0.000
'm
Token'm
Feature activation+0.000
'
Token '
Feature activation+0.000
As
TokenAs
Feature activation+0.002
anyone
Token anyone
Feature activation+0.000
who
Token who
Feature activation+0.000
has
Token has
Feature activation+0.000
seen
Token seen
Feature activation+3.179
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.000
can
Token can
Feature activation+0.000
attest
Token attest
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
shot
Token shot
Feature activation+3.133
it
Token it
Feature activation+0.197
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
watched
Token watched
Feature activation+3.169
it
Token it
Feature activation+0.249
in
Token in
Feature activation+0.375
the
Token the
Feature activation+0.187
video
Token video
Feature activation+0.785
store
Token store
Feature activation+0.000
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.000
just
Token just
Feature activation+0.008
thought
Token thought
Feature activation+0.000
we
Token we
Feature activation+0.000
shot
Token shot
Feature activation+3.133
it
Token it
Feature activation+0.197
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
watched
Token watched
Feature activation+3.169
it
Token it
Feature activation+0.249
20
Token 20
Feature activation+0.000
th
Tokenth
Feature activation+0.468
anniversary
Token anniversary
Feature activation+0.000
with
Token with
Feature activation+0.000
a
Token a
Feature activation+0.000
screening
Token screening
Feature activation+3.041
at
Token at
Feature activation+0.224
this
Token this
Feature activation+0.000
year
Token year
Feature activation+0.000
's
Token's
Feature activation+0.118
Tall
Token Tall
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Is
TokenIs
Feature activation+0.000
le
Tokenle
Feature activation+0.000
of
Token of
Feature activation+0.000
Dogs
Token Dogs
Feature activation+0.000
premie
Token premie
Feature activation+2.971
res
Tokenres
Feature activation+0.921
in
Token in
Feature activation+0.000
theat
Token theat
Feature activation+0.769
res
Tokenres
Feature activation+1.007
on
Token on
Feature activation+0.000
you
Token you
Feature activation+0.000
to
Token to
Feature activation+0.000
process
Token process
Feature activation+0.000
them
Token them
Feature activation+0.000
.
Token.
Feature activation+0.000
Director
Token Director
Feature activation+2.971
,
Token,
Feature activation+0.000
Peter
Token Peter
Feature activation+0.000
S
Token S
Feature activation+0.000
ohn
Tokenohn
Feature activation+0.000
and
Token and
Feature activation+0.000
to
Token to
Feature activation+0.000
film
Token film
Feature activation+0.683
festivals
Token festivals
Feature activation+0.000
,
Token,
Feature activation+0.000
people
Token people
Feature activation+0.151
watch
Token watch
Feature activation+2.932
it
Token it
Feature activation+0.169
,
Token,
Feature activation+0.000
they
Token they
Feature activation+0.000
can
Token can
Feature activation+0.000
buy
Token buy
Feature activation+0.000
head
Token head
Feature activation+0.000
out
Token out
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
next
Token next
Feature activation+0.000
premiere
Token premiere
Feature activation+2.884
,
Token,
Feature activation+0.000
film
Token film
Feature activation+0.559
festival
Token festival
Feature activation+0.000
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
film
Token film
Feature activation+1.106
would
Token would
Feature activation+0.000
not
Token not
Feature activation+0.000
be
Token be
Feature activation+0.000
released
Token released
Feature activation+2.769
in
Token in
Feature activation+0.197
Pakistan
Token Pakistan
Feature activation+0.000
.
Token.
Feature activation+0.000
D
Token D
Feature activation+0.000
ang
Tokenang
Feature activation+0.000
the
Token the
Feature activation+0.000
animal
Token animal
Feature activation+0.000
that
Token that
Feature activation+0.000
he
Token he
Feature activation+0.000
and
Token and
Feature activation+0.000
director
Token director
Feature activation+2.721
Gil
Token Gil
Feature activation+0.000
roy
Tokenroy
Feature activation+0.000
say
Token say
Feature activation+0.000
inspired
Token inspired
Feature activation+0.116
Bloom
Token Bloom
Feature activation+0.000
make
Token make
Feature activation+0.000
up
Token up
Feature activation+0.000
and
Token and
Feature activation+0.000
hair
Token hair
Feature activation+0.000
and
Token and
Feature activation+0.000
editing
Token editing
Feature activation+2.688
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
film
Token film
Feature activation+0.000
group
Token group
Feature activation+0.000
's
Token's
Feature activation+0.000
Top
Token Top
Feature activation+0.000
10
Token 10
Feature activation+0.000
titles
Token titles
Feature activation+2.621
include
Token include
Feature activation+0.000
Gy
Token Gy
Feature activation+0.000
ll
Tokenll
Feature activation+0.000
en
Tokenen
Feature activation+0.000
ha
Tokenha
Feature activation+0.000
Lei
Token Lei
Feature activation+0.000
How
Token How
Feature activation+0.000
den
Tokenden
Feature activation+0.000
makes
Token makes
Feature activation+0.433
his
Token his
Feature activation+0.000
director
Token director
Feature activation+2.573
ial
Tokenial
Feature activation+0.000
debut
Token debut
Feature activation+0.331
with
Token with
Feature activation+0.000
Death
Token Death
Feature activation+0.000
g
Tokeng
Feature activation+0.000
reason
Token reason
Feature activation+0.000
to
Token to
Feature activation+0.000
chop
Token chop
Feature activation+0.000
off
Token off
Feature activation+0.000
those
Token those
Feature activation+0.000
scenes
Token scenes
Feature activation+2.537
?
Token?
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
had
Token had
Feature activation+0.000
to
Token to
Feature activation+0.000
go
Token go
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
premiere
Token premiere
Feature activation+2.502
as
Token as
Feature activation+0.000
that
Token that
Feature activation+0.000
was
Token was
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
a
Token a
Feature activation+0.000
special
Token special
Feature activation+0.000
release
Token release
Feature activation+0.791
and
Token and
Feature activation+0.000
public
Token public
Feature activation+0.711
viewing
Token viewing
Feature activation+2.494
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.456
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

Top DFA by src position
MAX = 4.468

feeling
Token feeling
Feature activation-0.021
Top resid features:
more
Token more
Feature activation+0.011
Top resid features:
disturbed
Token disturbed
Feature activation-0.001
Top resid features:
at
Token at
Feature activation-0.012
Top resid features:
the
Token the
Feature activation+0.018
Top resid features:
film
Token film
Feature activation+4.468
Top resid features:
's
Token's
Feature activation-0.021
Top resid features:
conclusion
Token conclusion
Feature activation+0.009
Top resid features:
than
Token than
Feature activation-0.005
Top resid features:
you
Token you
Feature activation-0.015
Top resid features:
did
Token did
Feature activation-0.019
Top resid features:
One
TokenOne
Feature activation-0.005
Top resid features:
of
Token of
Feature activation+0.004
Top resid features:
India
Token India
Feature activation-0.005
Top resid features:
's
Token's
Feature activation-0.004
Top resid features:
prominent
Token prominent
Feature activation+0.036
Top resid features:
film
Token film
Feature activation+4.088
Top resid features:
directors
Token directors
Feature activation+0.171
Top resid features:
wrote
Token wrote
Feature activation-0.089
Top resid features:
:
Token:
Feature activation-0.033
Top resid features:
Ċ
TokenĊ
Feature activation-0.003
Top resid features:
Ċ
TokenĊ
Feature activation-0.007
Top resid features:
Ļ
TokenĻ
Feature activation+0.003
Top resid features:
s
Tokens
Feature activation+0.019
Top resid features:
really
Token really
Feature activation+0.004
Top resid features:
no
Token no
Feature activation+0.014
Top resid features:
other
Token other
Feature activation-0.002
Top resid features:
films
Token films
Feature activation+2.219
Top resid features:
that
Token that
Feature activation+0.009
Top resid features:
this
Token this
Feature activation+0.020
Top resid features:
could
Token could
Feature activation+0.007
Top resid features:
be
Token be
Feature activation+0.002
Top resid features:
compared
Token compared
Feature activation+0.010
Top resid features:
anyone
Token anyone
Feature activation+0.001
Top resid features:
can
Token can
Feature activation-0.006
Top resid features:
post
Token post
Feature activation+0.004
Top resid features:
this
Token this
Feature activation+0.004
Top resid features:
wonderful
Token wonderful
Feature activation-0.039
Top resid features:
movie
Token movie
Feature activation+2.373
Top resid features:
again
Token again
Feature activation-0.036
Top resid features:
in
Token in
Feature activation-0.077
Top resid features:
a
Token a
Feature activation-0.063
Top resid features:
more
Token more
Feature activation-0.011
Top resid features:
watch
Token watch
Feature activation+0.072
Top resid features:
.'
Token.'
Feature activation+0.015
Top resid features:
Ċ
TokenĊ
Feature activation-0.003
Top resid features:
Ċ
TokenĊ
Feature activation-0.004
Top resid features:
But
TokenBut
Feature activation+0.005
Top resid features:
the
Token the
Feature activation+0.026
Top resid features:
film
Token film
Feature activation+3.952
Top resid features:
's
Token's
Feature activation-0.002
Top resid features:
studio
Token studio
Feature activation+0.047
Top resid features:
,
Token,
Feature activation-0.026
Top resid features:
Fox
Token Fox
Feature activation+0.024
Top resid features:
,
Token,
Feature activation-0.026
Top resid features:
poster
Token poster
Feature activation+0.017
Top resid features:
.
Token.
Feature activation+0.009
Top resid features:
We
Token We
Feature activation+0.003
Top resid features:
shot
Token shot
Feature activation+0.030
Top resid features:
the
Token the
Feature activation+0.015
Top resid features:
movie
Token movie
Feature activation+1.783
Top resid features:
with
Token with
Feature activation+0.006
Top resid features:
a
Token a
Feature activation+0.016
Top resid features:
couple
Token couple
Feature activation+0.001
Top resid features:
of
Token of
Feature activation+0.003
Top resid features:
friends
Token friends
Feature activation-0.002
Top resid features:
're
Token're
Feature activation+0.004
Top resid features:
gonna
Token gonna
Feature activation+0.004
Top resid features:
bring
Token bring
Feature activation+0.004
Top resid features:
this
Token this
Feature activation+0.015
Top resid features:
to
Token to
Feature activation-0.003
Top resid features:
film
Token film
Feature activation+2.168
Top resid features:
festivals
Token festivals
Feature activation+0.024
Top resid features:
,
Token,
Feature activation-0.005
Top resid features:
people
Token people
Feature activation-0.000
Top resid features:
watch
Token watch
Feature activation+0.021
Top resid features:
it
Token it
Feature activation+0.003
Top resid features:
they
Token they
Feature activation+0.007
Top resid features:
got
Token got
Feature activation-0.005
Top resid features:
to
Token to
Feature activation+0.004
Top resid features:
make
Token make
Feature activation+0.004
Top resid features:
a
Token a
Feature activation+0.011
Top resid features:
movie
Token movie
Feature activation+2.271
Top resid features:
in
Token in
Feature activation-0.002
Top resid features:
Wichita
Token Wichita
Feature activation+0.050
Top resid features:
,
Token,
Feature activation-0.008
Top resid features:
Kansas
Token Kansas
Feature activation+0.025
Top resid features:
?"
Token?"
Feature activation+0.015
Top resid features:
Ċ
TokenĊ
Feature activation+0.009
Top resid features:
Ċ
TokenĊ
Feature activation+0.005
Top resid features:
The
TokenThe
Feature activation+0.007
Top resid features:
stop
Token stop
Feature activation-0.003
Top resid features:
motion
Token motion
Feature activation+0.022
Top resid features:
film
Token film
Feature activation+3.916
Top resid features:
will
Token will
Feature activation+0.001
Top resid features:
star
Token star
Feature activation+0.105
Top resid features:
Bill
Token Bill
Feature activation-0.000
Top resid features:
Murray
Token Murray
Feature activation+0.006
Top resid features:
,
Token,
Feature activation+0.003
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.052
Top resid features:
,
Token,
Feature activation+0.008
Top resid features:
this
Token this
Feature activation+0.027
Top resid features:
film
Token film
Feature activation+4.098
Top resid features:
isn
Token isn
Feature activation+0.002
Top resid features:
âĢ
TokenâĢ
Feature activation+0.020
Top resid features:
Ļ
TokenĻ
Feature activation+0.013
Top resid features:
t
Tokent
Feature activation+0.013
Top resid features:
rushed
Token rushed
Feature activation+0.011
Top resid features:
poster
Token poster
Feature activation+0.014
Top resid features:
.
Token.
Feature activation+0.012
Top resid features:
We
Token We
Feature activation+0.004
Top resid features:
shot
Token shot
Feature activation+0.046
Top resid features:
the
Token the
Feature activation+0.026
Top resid features:
movie
Token movie
Feature activation+1.963
Top resid features:
with
Token with
Feature activation+0.010
Top resid features:
a
Token a
Feature activation+0.026
Top resid features:
couple
Token couple
Feature activation+0.005
Top resid features:
of
Token of
Feature activation+0.009
Top resid features:
friends
Token friends
Feature activation-0.001
Top resid features:
of
Token of
Feature activation+0.005
Top resid features:
an
Token an
Feature activation+0.009
Top resid features:
actor
Token actor
Feature activation+0.347
Top resid features:
promoting
Token promoting
Feature activation+0.014
Top resid features:
three
Token three
Feature activation+0.003
Top resid features:
films
Token films
Feature activation+3.689
Top resid features:
simultaneously
Token simultaneously
Feature activation+0.004
Top resid features:
(
Token (
Feature activation-0.005
Top resid features:
Sh
TokenSh
Feature activation-0.000
Top resid features:
ame
Tokename
Feature activation+0.004
Top resid features:
,
Token,
Feature activation-0.003
Top resid features:
reference
Token reference
Feature activation+0.006
Top resid features:
to
Token to
Feature activation+0.011
Top resid features:
Pakistan
Token Pakistan
Feature activation-0.005
Top resid features:
.
Token.
Feature activation+0.012
Top resid features:
The
Token The
Feature activation+0.010
Top resid features:
film
Token film
Feature activation+1.540
Top resid features:
only
Token only
Feature activation+0.005
Top resid features:
highlights
Token highlights
Feature activation+0.022
Top resid features:
India
Token India
Feature activation+0.001
Top resid features:
âĢ
TokenâĢ
Feature activation+0.003
Top resid features:
Ļ
TokenĻ
Feature activation+0.005
Top resid features:
and
Token and
Feature activation+0.032
Top resid features:
30
Token 30
Feature activation-0.017
Top resid features:
pounds
Token pounds
Feature activation+0.024
Top resid features:
for
Token for
Feature activation+0.003
Top resid features:
the
Token the
Feature activation+0.040
Top resid features:
movie
Token movie
Feature activation+3.701
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.003
Top resid features:
another
Tokenanother
Feature activation-0.012
Top resid features:
echo
Token echo
Feature activation-0.004
Top resid features:
of
Token of
Feature activation+0.008
Top resid features:
young
Token young
Feature activation-0.007
Top resid features:
the
Token the
Feature activation+0.016
Top resid features:
69
Token 69
Feature activation-0.003
Top resid features:
th
Tokenth
Feature activation+0.007
Top resid features:
British
Token British
Feature activation+0.006
Top resid features:
Academy
Token Academy
Feature activation-0.001
Top resid features:
Film
Token Film
Feature activation+2.128
Top resid features:
Awards
Token Awards
Feature activation+0.005
Top resid features:
on
Token on
Feature activation+0.004
Top resid features:
Sunday
Token Sunday
Feature activation+0.023
Top resid features:
night
Token night
Feature activation+0.015
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
it
Token it
Feature activation+0.005
Top resid features:
onto
Token onto
Feature activation-0.004
Top resid features:
the
Token the
Feature activation+0.009
Top resid features:
Toronto
Token Toronto
Feature activation+0.004
Top resid features:
International
Token International
Feature activation+0.010
Top resid features:
Film
Token Film
Feature activation+1.236
Top resid features:
Festival
Token Festival
Feature activation+0.033
Top resid features:
's
Token's
Feature activation-0.003
Top resid features:
annual
Token annual
Feature activation+0.014
Top resid features:
list
Token list
Feature activation-0.016
Top resid features:
of
Token of
Feature activation-0.008
Top resid features:
in
Token in
Feature activation+0.008
Top resid features:
the
Token the
Feature activation+0.011
Top resid features:
footsteps
Token footsteps
Feature activation+0.001
Top resid features:
of
Token of
Feature activation+0.007
Top resid features:
such
Token such
Feature activation+0.008
Top resid features:
films
Token films
Feature activation+3.219
Top resid features:
as
Token as
Feature activation+0.004
Top resid features:
Dead
Token Dead
Feature activation-0.002
Top resid features:
Alive
Token Alive
Feature activation-0.069
Top resid features:
,
Token,
Feature activation-0.002
Top resid features:
The
TokenThe
Feature activation+0.018
Top resid features:
a
Token a
Feature activation+0.024
Top resid features:
sports
Token sports
Feature activation+0.012
Top resid features:
-
Token-
Feature activation+0.002
Top resid features:
based
Tokenbased
Feature activation+0.003
Top resid features:
bi
Token bi
Feature activation-0.004
Top resid features:
opic
Tokenopic
Feature activation+1.803
Top resid features:
with
Token with
Feature activation+0.006
Top resid features:
no
Token no
Feature activation+0.010
Top resid features:
direct
Token direct
Feature activation+0.011
Top resid features:
or
Token or
Feature activation+0.004
Top resid features:
indirect
Token indirect
Feature activation+0.005
Top resid features:
like
Token like
Feature activation-0.027
Top resid features:
for
Token for
Feature activation-0.021
Top resid features:
the
Token the
Feature activation+0.003
Top resid features:
Star
Token Star
Feature activation+0.012
Top resid features:
Trek
Token Trek
Feature activation+0.095
Top resid features:
movies
Token movies
Feature activation+3.448
Top resid features:
I
Token I
Feature activation-0.068
Top resid features:
had
Token had
Feature activation-0.074
Top resid features:
to
Token to
Feature activation-0.054
Top resid features:
go
Token go
Feature activation-0.045
Top resid features:
to
Token to
Feature activation-0.053
Top resid features:
a
Token a
Feature activation-0.002
Top resid features:
special
Token special
Feature activation+0.019
Top resid features:
screening
Token screening
Feature activation+0.250
Top resid features:
of
Token of
Feature activation-0.007
Top resid features:
the
Token the
Feature activation+0.003
Top resid features:
film
Token film
Feature activation+1.033
Top resid features:
,
Token,
Feature activation-0.026
Top resid features:
and
Token and
Feature activation-0.026
Top resid features:
later
Token later
Feature activation+0.002
Top resid features:
,
Token,
Feature activation-0.027
Top resid features:
the
Token the
Feature activation-0.002
Top resid features:

Decoder Weights Distribution

Head 0: 0.03

Head 1: 0.03

Head 2: 0.03

Head 3: 0.01

Head 4: 0.03

Head 5: 0.10

Head 6: 0.53

Head 7: 0.06

Head 8: 0.02

Head 9: 0.02

Head 10: 0.03

Head 11: 0.09

Positive logits

��2.44

writer2.33

writers2.22

excerpts2.16

films2.13

isodes1.99

convol1.97

netflix1.92

nudity1.90

itone1.90

portray1.90

actresses1.89

Films1.86

ovies1.84

paperback1.81

narration1.81

portraying1.79

1.79

obook1.79

NIGHT1.78

Negative logits

leve-1.75

neighbor-1.75

Wealth-1.72

traded-1.71

laus-1.71

ndra-1.69

routed-1.69

Slack-1.68

neau-1.67

Rails-1.67

GP-1.65

votes-1.64

mone-1.61

osi-1.61

parked-1.57

uninsured-1.57

drilled-1.56

defensive-1.55

Hockey-1.55

nard-1.54

INTERVAL 3.076 - 3.418
CONTAINS 0.001%

.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.000
just
Token just
Feature activation+0.008
thought
Token thought
Feature activation+0.000
we
Token we
Feature activation+0.000
shot
Token shot
Feature activation+3.133
it
Token it
Feature activation+0.197
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
watched
Token watched
Feature activation+3.169
it
Token it
Feature activation+0.249
'
Token '
Feature activation+0.000
As
TokenAs
Feature activation+0.002
anyone
Token anyone
Feature activation+0.000
who
Token who
Feature activation+0.000
has
Token has
Feature activation+0.000
seen
Token seen
Feature activation+3.179
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.000
can
Token can
Feature activation+0.000
attest
Token attest
Feature activation+0.000
,
Token,
Feature activation+0.000
hang
Token hang
Feature activation+0.000
man
Token man
Feature activation+0.000
I
Token I
Feature activation+0.000
would
Token would
Feature activation+0.000
have
Token have
Feature activation+0.000
shown
Token shown
Feature activation+3.299
the
Token the
Feature activation+0.000
pictures
Token pictures
Feature activation+0.917
of
Token of
Feature activation+0.000
all
Token all
Feature activation+0.000
his
Token his
Feature activation+0.000
we
Token we
Feature activation+0.000
shot
Token shot
Feature activation+3.133
it
Token it
Feature activation+0.197
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
watched
Token watched
Feature activation+3.169
it
Token it
Feature activation+0.249
in
Token in
Feature activation+0.375
the
Token the
Feature activation+0.187
video
Token video
Feature activation+0.785
store
Token store
Feature activation+0.000
that
Token that
Feature activation+0.130
it
Token it
Feature activation+0.663
needed
Token needed
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
seen
Token seen
Feature activation+3.199
on
Token on
Feature activation+0.291
the
Token the
Feature activation+0.271
biggest
Token biggest
Feature activation+0.003
screen
Token screen
Feature activation+1.140
possible
Token possible
Feature activation+0.000

INTERVAL 2.734 - 3.076
CONTAINS 0.001%

Ċ
TokenĊ
Feature activation+0.000
Is
TokenIs
Feature activation+0.000
le
Tokenle
Feature activation+0.000
of
Token of
Feature activation+0.000
Dogs
Token Dogs
Feature activation+0.000
premie
Token premie
Feature activation+2.971
res
Tokenres
Feature activation+0.921
in
Token in
Feature activation+0.000
theat
Token theat
Feature activation+0.769
res
Tokenres
Feature activation+1.007
on
Token on
Feature activation+0.000
20
Token 20
Feature activation+0.000
th
Tokenth
Feature activation+0.468
anniversary
Token anniversary
Feature activation+0.000
with
Token with
Feature activation+0.000
a
Token a
Feature activation+0.000
screening
Token screening
Feature activation+3.041
at
Token at
Feature activation+0.224
this
Token this
Feature activation+0.000
year
Token year
Feature activation+0.000
's
Token's
Feature activation+0.118
Tall
Token Tall
Feature activation+0.000
the
Token the
Feature activation+0.000
film
Token film
Feature activation+1.106
would
Token would
Feature activation+0.000
not
Token not
Feature activation+0.000
be
Token be
Feature activation+0.000
released
Token released
Feature activation+2.769
in
Token in
Feature activation+0.197
Pakistan
Token Pakistan
Feature activation+0.000
.
Token.
Feature activation+0.000
D
Token D
Feature activation+0.000
ang
Tokenang
Feature activation+0.000
to
Token to
Feature activation+0.000
film
Token film
Feature activation+0.683
festivals
Token festivals
Feature activation+0.000
,
Token,
Feature activation+0.000
people
Token people
Feature activation+0.151
watch
Token watch
Feature activation+2.932
it
Token it
Feature activation+0.169
,
Token,
Feature activation+0.000
they
Token they
Feature activation+0.000
can
Token can
Feature activation+0.000
buy
Token buy
Feature activation+0.000
you
Token you
Feature activation+0.000
to
Token to
Feature activation+0.000
process
Token process
Feature activation+0.000
them
Token them
Feature activation+0.000
.
Token.
Feature activation+0.000
Director
Token Director
Feature activation+2.971
,
Token,
Feature activation+0.000
Peter
Token Peter
Feature activation+0.000
S
Token S
Feature activation+0.000
ohn
Tokenohn
Feature activation+0.000
and
Token and
Feature activation+0.000

INTERVAL 2.393 - 2.734
CONTAINS 0.001%

the
Token the
Feature activation+0.000
man
Token man
Feature activation+0.000
walking
Token walking
Feature activation+0.000
across
Token across
Feature activation+0.000
the
Token the
Feature activation+0.000
shot
Token shot
Feature activation+2.434
was
Token was
Feature activation+0.000
shot
Token shot
Feature activation+2.093
dead
Token dead
Feature activation+0.018
for
Token for
Feature activation+0.000
reasons
Token reasons
Feature activation+0.000
the
Token the
Feature activation+0.000
animal
Token animal
Feature activation+0.000
that
Token that
Feature activation+0.000
he
Token he
Feature activation+0.000
and
Token and
Feature activation+0.000
director
Token director
Feature activation+2.721
Gil
Token Gil
Feature activation+0.000
roy
Tokenroy
Feature activation+0.000
say
Token say
Feature activation+0.000
inspired
Token inspired
Feature activation+0.116
Bloom
Token Bloom
Feature activation+0.000
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.000
've
Token've
Feature activation+0.000
wanted
Token wanted
Feature activation+0.000
to
Token to
Feature activation+0.000
see
Token see
Feature activation+2.478
it
Token it
Feature activation+0.171
for
Token for
Feature activation+0.000
a
Token a
Feature activation+0.000
long
Token long
Feature activation+0.000
time
Token time
Feature activation+0.000
Lei
Token Lei
Feature activation+0.000
How
Token How
Feature activation+0.000
den
Tokenden
Feature activation+0.000
makes
Token makes
Feature activation+0.433
his
Token his
Feature activation+0.000
director
Token director
Feature activation+2.573
ial
Tokenial
Feature activation+0.000
debut
Token debut
Feature activation+0.331
with
Token with
Feature activation+0.000
Death
Token Death
Feature activation+0.000
g
Tokeng
Feature activation+0.000
you
Token you
Feature activation+0.000
be
Token be
Feature activation+0.000
lining
Token lining
Feature activation+0.000
up
Token up
Feature activation+0.000
to
Token to
Feature activation+0.000
see
Token see
Feature activation+2.396
it
Token it
Feature activation+0.000
?
Token?
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢĶ
TokenâĢĶ
Feature activation+0.000

INTERVAL 2.051 - 2.393
CONTAINS 0.002%

.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
series
Token series
Feature activation+0.000
will
Token will
Feature activation+0.000
be
Token be
Feature activation+0.000
shot
Token shot
Feature activation+2.207
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
U
Token U
Feature activation+0.000
.
Token.
Feature activation+0.000
K
TokenK
Feature activation+0.000
players
Token players
Feature activation+0.075
.
Token.
Feature activation+0.000
First
Token First
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
director
Token director
Feature activation+2.198
Louis
Token Louis
Feature activation+0.055
Le
Token Le
Feature activation+0.000
Prince
Token Prince
Feature activation+0.000
disappeared
Token disappeared
Feature activation+0.000
from
Token from
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
When
TokenWhen
Feature activation+0.000
I
Token I
Feature activation+0.000
asked
Token asked
Feature activation+0.000
director
Token director
Feature activation+2.066
Peter
Token Peter
Feature activation+0.000
A
Token A
Feature activation+0.000
ten
Tokenten
Feature activation+0.000
c
Tokenc
Feature activation+0.000
io
Tokenio
Feature activation+0.000
who
Token who
Feature activation+0.000
haven
Token haven
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
seen
Token seen
Feature activation+2.076
it
Token it
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
those
Token those
Feature activation+0.000
who
Token who
Feature activation+0.000
he
Token he
Feature activation+0.000
wanted
Token wanted
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
the
Token the
Feature activation+0.000
title
Token title
Feature activation+2.325
character
Token character
Feature activation+0.543
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
knew
Token knew
Feature activation+0.000
there
Token there
Feature activation+0.000

INTERVAL 1.709 - 2.051
CONTAINS 0.004%

in
Token in
Feature activation+0.000
association
Token association
Feature activation+0.000
with
Token with
Feature activation+0.000
Sp
Token Sp
Feature activation+0.000
indle
Tokenindle
Feature activation+0.000
Productions
Token Productions
Feature activation+1.870
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
full
Token full
Feature activation+0.000
length
Token length
Feature activation+0.000
version
Token version
Feature activation+0.501
It
Token It
Feature activation+0.185
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
worth
Token worth
Feature activation+0.000
watching
Token watching
Feature activation+1.777
7
Token 7
Feature activation+0.000
/
Token/
Feature activation+0.000
10
Token10
Feature activation+0.000
.
Token.
Feature activation+0.000
Actually
Token Actually
Feature activation+0.000
-
Token-
Feature activation+0.000
m
Tokenm
Feature activation+0.000
ovies
Tokenovies
Feature activation+0.423
that
Token that
Feature activation+0.000
have
Token have
Feature activation+0.000
filmed
Token filmed
Feature activation+1.741
in
Token in
Feature activation+0.075
Wichita
Token Wichita
Feature activation+0.000
include
Token include
Feature activation+0.000
Darkness
Token Darkness
Feature activation+0.056
,
Token,
Feature activation+0.000
as
Token as
Feature activation+0.000
noted
Token noted
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
picture
Token picture
Feature activation+2.009
quality
Token quality
Feature activation+2.000
leaves
Token leaves
Feature activation+0.073
a
Token a
Feature activation+0.000
lot
Token lot
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
âĢĶ
Token âĢĶ
Feature activation+0.000
after
Token after
Feature activation+0.000
all
Token all
Feature activation+0.000
,
Token,
Feature activation+0.000
entire
Token entire
Feature activation+0.000
scenes
Token scenes
Feature activation+1.996
are
Token are
Feature activation+0.000
built
Token built
Feature activation+0.000
around
Token around
Feature activation+0.000
George
Token George
Feature activation+0.037
Michael
Token Michael
Feature activation+0.000

INTERVAL 1.367 - 1.709
CONTAINS 0.004%

week
Token week
Feature activation+0.005
early
Token early
Feature activation+0.000
not
Token not
Feature activation+0.000
only
Token only
Feature activation+0.000
to
Token to
Feature activation+0.000
promote
Token promote
Feature activation+1.529
that
Token that
Feature activation+0.130
it
Token it
Feature activation+0.663
needed
Token needed
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
Al
Token Al
Feature activation+0.000
amo
Tokenamo
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
was
Token was
Feature activation+0.000
shot
Token shot
Feature activation+1.493
last
Token last
Feature activation+0.000
summer
Token summer
Feature activation+0.000
for
Token for
Feature activation+0.000
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.261
after
Token after
Feature activation+0.000
this
Token this
Feature activation+0.000
year
Token year
Feature activation+0.000
's
Token's
Feature activation+0.000
Oscar
Token Oscar
Feature activation+1.178
nominations
Token nominations
Feature activation+1.492
were
Token were
Feature activation+0.000
released
Token released
Feature activation+0.618
and
Token and
Feature activation+0.000
not
Token not
Feature activation+0.000
a
Token a
Feature activation+0.000
available
Token available
Feature activation+0.195
slots
Token slots
Feature activation+0.693
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
four
Token four
Feature activation+0.000
acting
Token acting
Feature activation+1.494
categories
Token categories
Feature activation+0.604
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
This
TokenThis
Feature activation+0.000
made
Token made
Feature activation+0.000
by
Token by
Feature activation+0.000
a
Token a
Feature activation+0.000
known
Token known
Feature activation+0.000
heterosexual
Token heterosexual
Feature activation+0.000
director
Token director
Feature activation+1.371
features
Token features
Feature activation+0.825
the
Token the
Feature activation+0.000
most
Token most
Feature activation+0.000
talented
Token talented
Feature activation+0.613
and
Token and
Feature activation+0.000

INTERVAL 1.025 - 1.367
CONTAINS 0.008%

autobiography
Token autobiography
Feature activation+0.000
or
Token or
Feature activation+0.000
have
Token have
Feature activation+0.000
a
Token a
Feature activation+0.000
film
Token film
Feature activation+0.000
made
Token made
Feature activation+1.189
about
Token about
Feature activation+0.000
his
Token his
Feature activation+0.000
life
Token life
Feature activation+0.000
,
Token,
Feature activation+0.000
Campbell
Token Campbell
Feature activation+0.000
frame
Token frame
Feature activation+1.376
of
Token of
Feature activation+0.000
5
Token 5
Feature activation+0.115
.
Token.
Feature activation+0.000
62
Token62
Feature activation+0.000
seconds
Token seconds
Feature activation+1.174
and
Token and
Feature activation+0.000
there
Token there
Feature activation+0.326
were
Token were
Feature activation+0.000
3
Token 3
Feature activation+0.089
bullets
Token bullets
Feature activation+0.000
fantasy
Token fantasy
Feature activation+0.000
epic
Token epic
Feature activation+0.000
,
Token,
Feature activation+0.000
slated
Token slated
Feature activation+0.582
for
Token for
Feature activation+0.000
release
Token release
Feature activation+1.320
in
Token in
Feature activation+0.000
February
Token February
Feature activation+0.000
,
Token,
Feature activation+0.000
stars
Token stars
Feature activation+0.316
Scots
Token Scots
Feature activation+0.000
in
Token in
Feature activation+0.000
terms
Token terms
Feature activation+0.000
of
Token of
Feature activation+0.000
being
Token being
Feature activation+0.228
a
Token a
Feature activation+0.000
V
Token V
Feature activation+1.047
FX
TokenFX
Feature activation+1.075
-
Token-
Feature activation+0.000
laden
Tokenladen
Feature activation+0.000
action
Token action
Feature activation+0.448
-
Token-
Feature activation+0.000
Peter
Token Peter
Feature activation+0.000
Sell
Token Sell
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
character
Token character
Feature activation+1.099
.
Token.
Feature activation+0.000
He
Token He
Feature activation+0.000
played
Token played
Feature activation+1.154
Eve
Token Eve
Feature activation+0.000
lyn
Tokenlyn
Feature activation+0.000

INTERVAL 0.684 - 1.025
CONTAINS 0.015%

"
Token"
Feature activation+0.000
and
Token and
Feature activation+0.000
"
Token "
Feature activation+0.000
Maria
TokenMaria
Feature activation+0.000
")
Token")
Feature activation+0.000
make
Token make
Feature activation+0.761
all
Token all
Feature activation+0.000
points
Token points
Feature activation+0.000
on
Token on
Feature activation+0.000
his
Token his
Feature activation+0.000
behalf
Token behalf
Feature activation+0.000
...
Token...
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Reviewer
TokenReviewer
Feature activation+0.000
:
Token:
Feature activation+0.000
FP
Token FP
Feature activation+0.973
-
Token -
Feature activation+0.000
favorite
Token favorite
Feature activation+0.000
-
Token -
Feature activation+0.000
December
Token December
Feature activation+0.000
17
Token 17
Feature activation+0.000
it
Token it
Feature activation+0.626
normally
Token normally
Feature activation+0.000
on
Token on
Feature activation+0.217
the
Token the
Feature activation+0.148
25
Token 25
Feature activation+1.247
th
Tokenth
Feature activation+0.728
would
Token would
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.000
lost
Token lost
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
special
Token special
Feature activation+0.000
release
Token release
Feature activation+0.791
and
Token and
Feature activation+0.000
public
Token public
Feature activation+0.711
viewing
Token viewing
Feature activation+2.494
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.456
.
Token.
Feature activation+0.000
career
Token career
Feature activation+0.249
in
Token in
Feature activation+0.000
film
Token film
Feature activation+0.731
as
Token as
Feature activation+0.000
an
Token an
Feature activation+0.000
actor
Token actor
Feature activation+0.754
and
Token and
Feature activation+0.000
screen
Token screen
Feature activation+1.254
writer
Tokenwriter
Feature activation+0.000
and
Token and
Feature activation+0.000
became
Token became
Feature activation+0.000

INTERVAL 0.342 - 0.684
CONTAINS 0.032%

her
Token her
Feature activation+0.000
entire
Token entire
Feature activation+0.000
family
Token family
Feature activation+0.000
that
Token that
Feature activation+0.000
was
Token was
Feature activation+0.000
made
Token made
Feature activation+0.475
in
Token in
Feature activation+0.000
1981
Token 1981
Feature activation+0.000
but
Token but
Feature activation+0.000
not
Token not
Feature activation+0.000
released
Token released
Feature activation+1.343
Ċ
TokenĊ
Feature activation+0.000
Everyone
TokenEveryone
Feature activation+0.000
,
Token,
Feature activation+0.000
from
Token from
Feature activation+0.000
the
Token the
Feature activation+0.000
production
Token production
Feature activation+0.608
crew
Token crew
Feature activation+0.533
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
hair
Token hair
Feature activation+0.000
and
Token and
Feature activation+0.000
story
Token story
Feature activation+0.132
foundation
Token foundation
Feature activation+0.000
of
Token of
Feature activation+0.000
that
Tokenthat
Feature activation+0.007
Chapman
Token Chapman
Feature activation+0.000
created
Token created
Feature activation+0.467
was
Token was
Feature activation+0.000
intriguing
Token intriguing
Feature activation+0.051
and
Token and
Feature activation+0.000
it
Token it
Feature activation+0.025
would
Token would
Feature activation+0.000
the
Token the
Feature activation+0.000
end
Token end
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
series
Token series
Feature activation+0.000
premiere
Token premiere
Feature activation+0.494
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
character
Token character
Feature activation+0.018
-
Token-
Feature activation+0.000
inf
Tokeninf
Feature activation+0.000
licted
Tokenlicted
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.433
settles
Token settles
Feature activation+0.000
for
Token for
Feature activation+0.000
surface
Token surface
Feature activation+0.000
impressions
Token impressions
Feature activation+0.054
rather
Token rather
Feature activation+0.000

INTERVAL 0.000 - 0.342
CONTAINS 99.933%

turned
Token turned
Feature activation+0.000
away
Token away
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ad
TokenAd
Feature activation+0.000
voc
Tokenvoc
Feature activation+0.000
ates
Tokenates
Feature activation+0.000
have
Token have
Feature activation+0.000
said
Token said
Feature activation+0.000
this
Token this
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
re
Tokenre
Feature activation+0.000
jealous
Token jealous
Feature activation+0.000
of
Token of
Feature activation+0.000
your
Token your
Feature activation+0.000
sun
Token sun
Feature activation+0.000
and
Token and
Feature activation+0.000
tired
Token tired
Feature activation+0.000
of
Token of
Feature activation+0.000
their
Token their
Feature activation+0.000
shade
Token shade
Feature activation+0.000
Haskell
Token Haskell
Feature activation+0.000
(
Token (
Feature activation+0.000
Was
TokenWas
Feature activation+0.000
ps
Tokenps
Feature activation+0.000
)
Token)
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Jonathan
TokenJonathan
Feature activation+0.000
Joseph
Token Joseph
Feature activation+0.000
(
Token (
Feature activation+0.000
B
TokenB
Feature activation+0.000
pre
Token pre
Feature activation+0.000
processor
Tokenprocessor
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
is
Token is
Feature activation+0.000
in
Token in
Feature activation+0.000
reality
Token reality
Feature activation+0.000
a
Token a
Feature activation+0.000
separate
Token separate
Feature activation+0.000
program
Token program
Feature activation+0.000
(
Token (
Feature activation+0.000
were
Token were
Feature activation+0.000
actually
Token actually
Feature activation+0.000
serious
Token serious
Feature activation+0.000
about
Token about
Feature activation+0.000
this
Token this
Feature activation+0.000
.
Token.
Feature activation+0.000
Essentially
Token Essentially
Feature activation+0.000
,
Token,
Feature activation+0.000
it
Token it
Feature activation+0.000
boiled
Token boiled
Feature activation+0.000
down
Token down
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 8 in H1.6: (feature 20433

TOP ACTIVATIONS
MAX = 3.416

previous
Token previous
Feature activation+0.000
willingness
Token willingness
Feature activation+0.475
of
Token of
Feature activation+0.044
workers
Token workers
Feature activation+0.524
to
Token to
Feature activation+0.064
accept
Token accept
Feature activation+3.416
the
Token the
Feature activation+0.000
minimum
Token minimum
Feature activation+0.399
,
Token,
Feature activation+0.000
no
Token no
Feature activation+0.000
questions
Token questions
Feature activation+0.000
sensible
Token sensible
Feature activation+0.000
positions
Token positions
Feature activation+0.000
on
Token on
Feature activation+0.000
immigration
Token immigration
Feature activation+0.000
,
Token,
Feature activation+0.000
illegal
Token illegal
Feature activation+3.316
guns
Token guns
Feature activation+0.000
,
Token,
Feature activation+0.000
abortion
Token abortion
Feature activation+0.000
rights
Token rights
Feature activation+0.000
and
Token and
Feature activation+0.000
who
Token who
Feature activation+0.304
can
Token can
Feature activation+0.079
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
come
Token come
Feature activation+3.285
up
Token up
Feature activation+0.000
with
Token with
Feature activation+0.180
good
Token good
Feature activation+0.000
ones
Token ones
Feature activation+0.000
,
Token,
Feature activation+0.000
-
Token-
Feature activation+0.000
skill
Tokenskill
Feature activation+0.000
Sloven
Token Sloven
Feature activation+0.391
ians
Tokenians
Feature activation+0.204
to
Token to
Feature activation+0.000
enter
Token enter
Feature activation+2.715
Britain
Token Britain
Feature activation+0.829
than
Token than
Feature activation+0.000
higher
Token higher
Feature activation+0.000
-
Token-
Feature activation+0.000
skill
Tokenskill
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Once
TokenOnce
Feature activation+0.000
immigrants
Token immigrants
Feature activation+0.000
started
Token started
Feature activation+0.000
to
Token to
Feature activation+0.215
arrive
Token arrive
Feature activation+2.451
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
industrial
Token industrial
Feature activation+0.000
cities
Token cities
Feature activation+0.000
they
Token they
Feature activation+0.595
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
in
Token in
Feature activation+0.004
the
Token the
Feature activation+0.000
country
Token country
Feature activation+1.989
illegally
Token illegally
Feature activation+2.333
,
Token,
Feature activation+0.000
will
Token will
Feature activation+0.000
often
Token often
Feature activation+0.000
only
Token only
Feature activation+0.000
be
Token be
Feature activation+0.000
re
Tokenre
Feature activation+0.000
not
Token not
Feature activation+0.000
dissu
Token dissu
Feature activation+0.000
aded
Tokenaded
Feature activation+0.000
from
Token from
Feature activation+0.000
coming
Token coming
Feature activation+2.303
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
fear
Token fear
Feature activation+0.058
of
Token of
Feature activation+0.000
being
Token being
Feature activation+0.000
million
Token million
Feature activation+0.163
immigrants
Token immigrants
Feature activation+0.000
in
Token in
Feature activation+0.080
the
Token the
Feature activation+0.000
country
Token country
Feature activation+1.491
illegally
Token illegally
Feature activation+2.297
who
Token who
Feature activation+0.137
have
Token have
Feature activation+0.126
criminal
Token criminal
Feature activation+0.233
records
Token records
Feature activation+0.000
.
Token.
Feature activation+0.000
inal
Tokeninal
Feature activation+0.000
Tob
Token Tob
Feature activation+0.000
in
Tokenin
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
native
Token native
Feature activation+2.296
of
Token of
Feature activation+0.000
Michigan
Token Michigan
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
oldest
Token oldest
Feature activation+0.000
the
Token the
Feature activation+0.000
nation
Token nation
Feature activation+0.597
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
illegal
Token illegal
Feature activation+2.295
alien
Token alien
Feature activation+0.000
criminal
Token criminal
Feature activation+0.000
offenders
Token offenders
Feature activation+0.000
in
Token in
Feature activation+0.000
our
Token our
Feature activation+0.000
not
Token not
Feature activation+0.177
have
Token have
Feature activation+0.232
the
Token the
Feature activation+0.000
right
Token right
Feature activation+0.818
to
Token to
Feature activation+0.000
stay
Token stay
Feature activation+2.277
in
Token in
Feature activation+0.175
the
Token the
Feature activation+0.000
UK
Token UK
Feature activation+1.514
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
want
Token want
Feature activation+0.000
to
Token to
Feature activation+0.000
throw
Token throw
Feature activation+0.000
every
Token every
Feature activation+0.000
last
Token last
Feature activation+0.000
illegal
Token illegal
Feature activation+2.253
alien
Token alien
Feature activation+0.000
into
Token into
Feature activation+0.223
some
Token some
Feature activation+0.000
cattle
Token cattle
Feature activation+0.000
car
Token car
Feature activation+0.000
country
Token country
Feature activation+1.486
,
Token,
Feature activation+0.000
they
Token they
Feature activation+0.000
're
Token're
Feature activation+0.000
here
Token here
Feature activation+1.407
illegally
Token illegally
Feature activation+2.200
,"
Token,"
Feature activation+0.000
Trump
Token Trump
Feature activation+0.000
said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
de
Token de
Feature activation+0.000
porting
Tokenporting
Feature activation+0.000
some
Token some
Feature activation+0.000
immigrants
Token immigrants
Feature activation+0.000
here
Token here
Feature activation+1.621
illegally
Token illegally
Feature activation+2.150
.
Token.
Feature activation+0.000
Is
Token Is
Feature activation+0.000
that
Token that
Feature activation+0.000
why
Token why
Feature activation+0.000
so
Token so
Feature activation+0.000
a
Token a
Feature activation+0.000
U
Token U
Feature activation+1.516
.
Token.
Feature activation+0.099
S
TokenS
Feature activation+1.050
.
Token.
Feature activation+0.000
Border
Token Border
Feature activation+2.130
Patrol
Token Patrol
Feature activation+0.033
facility
Token facility
Feature activation+0.358
in
Token in
Feature activation+0.000
Tucson
Token Tucson
Feature activation+0.000
,
Token,
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
towel
Token towel
Feature activation+0.000
on
Token on
Feature activation+0.000
policing
Token policing
Feature activation+0.000
illegal
Token illegal
Feature activation+2.122
hotels
Token hotels
Feature activation+0.000
catering
Token catering
Feature activation+0.000
mostly
Token mostly
Feature activation+0.000
to
Token to
Feature activation+0.000
offshore
Token offshore
Feature activation+0.301
.
Token.
Feature activation+0.000
Business
Token Business
Feature activation+0.000
es
Tokenes
Feature activation+0.230
that
Token that
Feature activation+0.000
use
Token use
Feature activation+0.196
illegal
Token illegal
Feature activation+2.113
labour
Token labour
Feature activation+0.000
will
Token will
Feature activation+0.000
face
Token face
Feature activation+0.462
increased
Token increased
Feature activation+0.002
fines
Token fines
Feature activation+0.000
decide
Token decide
Feature activation+0.000
that
Token that
Feature activation+0.000
the
Token the
Feature activation+0.000
time
Token time
Feature activation+0.000
had
Token had
Feature activation+0.000
come
Token come
Feature activation+2.104
to
Token to
Feature activation+0.000
give
Token give
Feature activation+0.000
workers
Token workers
Feature activation+0.000
a
Token a
Feature activation+0.000
good
Token good
Feature activation+0.000
limit
Token limit
Feature activation+0.084
legal
Token legal
Feature activation+1.970
as
Token as
Feature activation+0.000
well
Token well
Feature activation+0.000
as
Token as
Feature activation+0.000
illegal
Token illegal
Feature activation+2.089
immigration
Token immigration
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Some
TokenSome
Feature activation+0.000
European
Token European
Feature activation+0.178
immigrants
Token immigrants
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
gradually
Token gradually
Feature activation+0.000
settled
Token settled
Feature activation+2.082
the
Token the
Feature activation+0.000
province
Token province
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
development
Token development
Feature activation+0.053

Top DFA by src position
MAX = 4.740

âĢ
TokenâĢ
Feature activation-0.001
Top resid features:
Ļ
TokenĻ
Feature activation-0.001
Top resid features:
t
Tokent
Feature activation+0.014
Top resid features:
have
Token have
Feature activation+0.007
Top resid features:
good
Token good
Feature activation+0.005
Top resid features:
immigration
Token immigration
Feature activation+2.223
Top resid features:
papers
Token papers
Feature activation+0.029
Top resid features:
.
Token.
Feature activation-0.009
Top resid features:
By
Token By
Feature activation+0.000
Top resid features:
asking
Token asking
Feature activation+0.019
Top resid features:
for
Token for
Feature activation+0.004
Top resid features:
also
Token also
Feature activation+0.018
Top resid features:
taken
Token taken
Feature activation+0.066
Top resid features:
sensible
Token sensible
Feature activation+0.078
Top resid features:
positions
Token positions
Feature activation+0.040
Top resid features:
on
Token on
Feature activation+0.013
Top resid features:
immigration
Token immigration
Feature activation+4.740
Top resid features:
,
Token,
Feature activation+0.061
Top resid features:
illegal
Token illegal
Feature activation-0.136
Top resid features:
guns
Token guns
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
abortion
Token abortion
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.002
Top resid features:
Ŀ
TokenĿ
Feature activation-0.010
Top resid features:
their
Token their
Feature activation+0.043
Top resid features:
workers
Token workers
Feature activation+0.030
Top resid features:
are
Token are
Feature activation+0.028
Top resid features:
immigrants
Token immigrants
Feature activation+2.486
Top resid features:
,
Token,
Feature activation-0.010
Top resid features:
and
Token and
Feature activation-0.014
Top resid features:
maybe
Token maybe
Feature activation+0.023
Top resid features:
some
Token some
Feature activation+0.035
Top resid features:
don
Token don
Feature activation+0.030
Top resid features:
a
Token a
Feature activation+0.005
Top resid features:
magnet
Token magnet
Feature activation-0.008
Top resid features:
for
Token for
Feature activation+0.005
Top resid features:
further
Token further
Feature activation-0.003
Top resid features:
EU
Token EU
Feature activation+0.017
Top resid features:
immigration
Token immigration
Feature activation+3.208
Top resid features:
.
Token.
Feature activation-0.010
Top resid features:
And
Token And
Feature activation+0.009
Top resid features:
no
Token no
Feature activation+0.012
Top resid features:
-
Token-
Feature activation-0.004
Top resid features:
one
Tokenone
Feature activation+0.013
Top resid features:
Janeiro
Token Janeiro
Feature activation-0.045
Top resid features:
.
Token.
Feature activation+0.010
Top resid features:
Ċ
TokenĊ
Feature activation+0.034
Top resid features:
Ċ
TokenĊ
Feature activation+0.045
Top resid features:
Once
TokenOnce
Feature activation+0.044
Top resid features:
immigrants
Token immigrants
Feature activation+3.664
Top resid features:
started
Token started
Feature activation+0.130
Top resid features:
to
Token to
Feature activation+0.143
Top resid features:
arrive
Token arrive
Feature activation+0.059
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
,
Token,
Feature activation-0.007
Top resid features:
practice
Token practice
Feature activation+0.015
Top resid features:
to
Token to
Feature activation+0.002
Top resid features:
protect
Token protect
Feature activation+0.008
Top resid features:
illegal
Token illegal
Feature activation-0.017
Top resid features:
immigrants
Token immigrants
Feature activation+2.429
Top resid features:
convicted
Token convicted
Feature activation+0.019
Top resid features:
of
Token of
Feature activation+0.007
Top resid features:
crimes
Token crimes
Feature activation-0.020
Top resid features:
from
Token from
Feature activation+0.018
Top resid features:
being
Token being
Feature activation+0.009
Top resid features:
the
Token the
Feature activation+0.086
Top resid features:
U
Token U
Feature activation-0.017
Top resid features:
.
Token.
Feature activation+0.011
Top resid features:
S
TokenS
Feature activation+0.006
Top resid features:
.
Token.
Feature activation+0.014
Top resid features:
immigration
Token immigration
Feature activation+3.260
Top resid features:
system
Token system
Feature activation+0.048
Top resid features:
.
Token.
Feature activation+0.020
Top resid features:
Ċ
TokenĊ
Feature activation+0.021
Top resid features:
Ċ
TokenĊ
Feature activation+0.028
Top resid features:
âĢ
TokenâĢ
Feature activation+0.044
Top resid features:
the
Token the
Feature activation+0.015
Top resid features:
countries
Token countries
Feature activation+0.005
Top resid features:
where
Token where
Feature activation+0.004
Top resid features:
the
Token the
Feature activation+0.013
Top resid features:
illegal
Token illegal
Feature activation+0.000
Top resid features:
immigrants
Token immigrants
Feature activation+1.912
Top resid features:
are
Token are
Feature activation+0.014
Top resid features:
from
Token from
Feature activation+0.011
Top resid features:
,
Token,
Feature activation-0.005
Top resid features:
are
Token are
Feature activation+0.013
Top resid features:
for
Token for
Feature activation+0.006
Top resid features:
are
Token are
Feature activation+0.036
Top resid features:
a
Token a
Feature activation+0.025
Top resid features:
beacon
Token beacon
Feature activation+0.014
Top resid features:
for
Token for
Feature activation+0.001
Top resid features:
all
Token all
Feature activation+0.036
Top resid features:
immigrants
Token immigrants
Feature activation+3.697
Top resid features:
.
Token.
Feature activation-0.037
Top resid features:
Ċ
TokenĊ
Feature activation-0.004
Top resid features:
Ċ
TokenĊ
Feature activation-0.004
Top resid features:
Card
TokenCard
Feature activation+0.087
Top resid features:
inal
Tokeninal
Feature activation-0.053
Top resid features:
have
Token have
Feature activation+0.005
Top resid features:
done
Token done
Feature activation+0.012
Top resid features:
to
Token to
Feature activation+0.007
Top resid features:
fight
Token fight
Feature activation+0.040
Top resid features:
illegal
Token illegal
Feature activation-0.031
Top resid features:
immigration
Token immigration
Feature activation+3.454
Top resid features:
,
Token,
Feature activation+0.001
Top resid features:
we
Token we
Feature activation+0.020
Top resid features:
have
Token have
Feature activation+0.009
Top resid features:
been
Token been
Feature activation+0.009
Top resid features:
responsible
Token responsible
Feature activation+0.037
Top resid features:
it
Token it
Feature activation+0.046
Top resid features:
.
Token.
Feature activation+0.005
Top resid features:
Ċ
TokenĊ
Feature activation+0.003
Top resid features:
Ċ
TokenĊ
Feature activation+0.004
Top resid features:
Imm
TokenImm
Feature activation+0.201
Top resid features:
igration
Tokenigration
Feature activation+2.030
Top resid features:
Bill
Token Bill
Feature activation+0.046
Top resid features:
Ċ
TokenĊ
Feature activation+0.004
Top resid features:
Ċ
TokenĊ
Feature activation+0.004
Top resid features:
This
TokenThis
Feature activation+0.047
Top resid features:
bill
Token bill
Feature activation+0.040
Top resid features:
.
Token.
Feature activation-0.002
Top resid features:
Ted
Token Ted
Feature activation+0.008
Top resid features:
Kennedy
Token Kennedy
Feature activation+0.012
Top resid features:
's
Token's
Feature activation+0.008
Top resid features:
1965
Token 1965
Feature activation-0.001
Top resid features:
immigration
Token immigration
Feature activation+3.317
Top resid features:
reform
Token reform
Feature activation-0.009
Top resid features:
,
Token,
Feature activation-0.001
Top resid features:
which
Token which
Feature activation+0.009
Top resid features:
opened
Token opened
Feature activation+0.002
Top resid features:
the
Token the
Feature activation+0.009
Top resid features:
the
Token the
Feature activation+0.007
Top resid features:
countries
Token countries
Feature activation+0.006
Top resid features:
where
Token where
Feature activation+0.002
Top resid features:
the
Token the
Feature activation+0.006
Top resid features:
illegal
Token illegal
Feature activation+0.008
Top resid features:
immigrants
Token immigrants
Feature activation+1.357
Top resid features:
are
Token are
Feature activation+0.005
Top resid features:
from
Token from
Feature activation+0.004
Top resid features:
,
Token,
Feature activation-0.003
Top resid features:
are
Token are
Feature activation+0.006
Top resid features:
for
Token for
Feature activation+0.002
Top resid features:
odds
Token odds
Feature activation+0.027
Top resid features:
with
Token with
Feature activation+0.003
Top resid features:
the
Token the
Feature activation+0.001
Top resid features:
president
Token president
Feature activation-0.006
Top resid features:
's
Token's
Feature activation+0.007
Top resid features:
immigration
Token immigration
Feature activation+2.040
Top resid features:
policies
Token policies
Feature activation-0.011
Top resid features:
--
Token --
Feature activation+0.001
Top resid features:
whether
Token whether
Feature activation+0.017
Top resid features:
it
Token it
Feature activation+0.013
Top resid features:
's
Token's
Feature activation+0.013
Top resid features:
have
Token have
Feature activation+0.013
Top resid features:
."
Token."
Feature activation-0.009
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation-0.066
Top resid features:
Il
TokenIl
Feature activation+0.061
Top resid features:
legal
Tokenlegal
Feature activation+0.045
Top resid features:
immigrants
Token immigrants
Feature activation+3.623
Top resid features:
file
Token file
Feature activation+0.043
Top resid features:
into
Token into
Feature activation+0.040
Top resid features:
a
Token a
Feature activation+0.011
Top resid features:
U
Token U
Feature activation-0.001
Top resid features:
.
Token.
Feature activation+0.003
Top resid features:
lowest
Token lowest
Feature activation+0.017
Top resid features:
declared
Token declared
Feature activation+0.005
Top resid features:
incomes
Token incomes
Feature activation+0.026
Top resid features:
of
Token of
Feature activation+0.003
Top resid features:
any
Token any
Feature activation+0.020
Top resid features:
immigration
Token immigration
Feature activation+2.709
Top resid features:
stream
Token stream
Feature activation-0.002
Top resid features:
,
Token,
Feature activation-0.001
Top resid features:
âĢ
Token âĢ
Feature activation+0.010
Top resid features:
ľ
Tokenľ
Feature activation-0.007
Top resid features:
lower
Tokenlower
Feature activation+0.017
Top resid features:
it
Token it
Feature activation+0.015
Top resid features:
.
Token.
Feature activation+0.006
Top resid features:
Ċ
TokenĊ
Feature activation-0.000
Top resid features:
Ċ
TokenĊ
Feature activation-0.000
Top resid features:
Imm
TokenImm
Feature activation+0.154
Top resid features:
igration
Tokenigration
Feature activation+1.534
Top resid features:
Bill
Token Bill
Feature activation+0.027
Top resid features:
Ċ
TokenĊ
Feature activation-0.003
Top resid features:
Ċ
TokenĊ
Feature activation-0.003
Top resid features:
This
TokenThis
Feature activation+0.014
Top resid features:
bill
Token bill
Feature activation+0.004
Top resid features:
âĢ
TokenâĢ
Feature activation+0.001
Top resid features:
Ŀ
TokenĿ
Feature activation-0.002
Top resid features:
their
Token their
Feature activation+0.020
Top resid features:
workers
Token workers
Feature activation+0.014
Top resid features:
are
Token are
Feature activation+0.011
Top resid features:
immigrants
Token immigrants
Feature activation+1.945
Top resid features:
,
Token,
Feature activation-0.001
Top resid features:
and
Token and
Feature activation+0.002
Top resid features:
maybe
Token maybe
Feature activation+0.010
Top resid features:
some
Token some
Feature activation+0.020
Top resid features:
don
Token don
Feature activation+0.013
Top resid features:
hard
Token hard
Feature activation+0.023
Top resid features:
-
Token-
Feature activation-0.009
Top resid features:
line
Tokenline
Feature activation+0.024
Top resid features:
stance
Token stance
Feature activation+0.030
Top resid features:
on
Token on
Feature activation+0.011
Top resid features:
immigration
Token immigration
Feature activation+1.916
Top resid features:
.
Token.
Feature activation-0.039
Top resid features:
Supporters
Token Supporters
Feature activation+0.036
Top resid features:
say
Token say
Feature activation+0.033
Top resid features:
Kob
Token Kob
Feature activation-0.047
Top resid features:
ach
Tokenach
Feature activation-0.002
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
18
Token 18
Feature activation+0.031
Top resid features:
78
Token78
Feature activation+0.013
Top resid features:
by
Token by
Feature activation-0.003
Top resid features:
Italian
Token Italian
Feature activation+0.054
Top resid features:
immigrants
Token immigrants
Feature activation+2.004
Top resid features:
who
Token who
Feature activation+0.022
Top resid features:
were
Token were
Feature activation+0.020
Top resid features:
soon
Token soon
Feature activation+0.010
Top resid features:
followed
Token followed
Feature activation+0.002
Top resid features:
by
Token by
Feature activation-0.014
Top resid features:

Decoder Weights Distribution

Head 0: 0.03

Head 1: 0.04

Head 2: 0.03

Head 3: 0.02

Head 4: 0.03

Head 5: 0.11

Head 6: 0.53

Head 7: 0.07

Head 8: 0.03

Head 9: 0.03

Head 10: 0.02

Head 11: 0.06

Positive logits

DACA2.21

Refugee2.20

refugee2.16

Refugees2.16

deport2.09

immigrant2.09

refugees2.04

born2.04

visa2.01

arrivals2.00

deported2.00

visas2.00

abouts1.97

immigrant1.92

ocumented1.91

boarding1.87

igrants1.87

Afghans1.86

Detention1.84

Census1.83

Negative logits

HY-2.14

insula-2.01

RAL-1.99

quickShipAvailable-1.97

velength-1.94

sonian-1.89

-1.87

magnification-1.86

IPM-1.85

outube-1.84

iron-1.84

roth-1.80

ELD-1.80

exerted-1.77

Zen-1.76

MET-1.75

Nich-1.71

elight-1.70

ury-1.70

antioxid-1.70

INTERVAL 3.074 - 3.416
CONTAINS 0.000%

sensible
Token sensible
Feature activation+0.000
positions
Token positions
Feature activation+0.000
on
Token on
Feature activation+0.000
immigration
Token immigration
Feature activation+0.000
,
Token,
Feature activation+0.000
illegal
Token illegal
Feature activation+3.316
guns
Token guns
Feature activation+0.000
,
Token,
Feature activation+0.000
abortion
Token abortion
Feature activation+0.000
rights
Token rights
Feature activation+0.000
and
Token and
Feature activation+0.000
who
Token who
Feature activation+0.304
can
Token can
Feature activation+0.079
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
come
Token come
Feature activation+3.285
up
Token up
Feature activation+0.000
with
Token with
Feature activation+0.180
good
Token good
Feature activation+0.000
ones
Token ones
Feature activation+0.000
,
Token,
Feature activation+0.000
previous
Token previous
Feature activation+0.000
willingness
Token willingness
Feature activation+0.475
of
Token of
Feature activation+0.044
workers
Token workers
Feature activation+0.524
to
Token to
Feature activation+0.064
accept
Token accept
Feature activation+3.416
the
Token the
Feature activation+0.000
minimum
Token minimum
Feature activation+0.399
,
Token,
Feature activation+0.000
no
Token no
Feature activation+0.000
questions
Token questions
Feature activation+0.000

INTERVAL 2.733 - 3.074
CONTAINS 0.000%

INTERVAL 2.391 - 2.733
CONTAINS 0.000%

-
Token-
Feature activation+0.000
skill
Tokenskill
Feature activation+0.000
Sloven
Token Sloven
Feature activation+0.391
ians
Tokenians
Feature activation+0.204
to
Token to
Feature activation+0.000
enter
Token enter
Feature activation+2.715
Britain
Token Britain
Feature activation+0.829
than
Token than
Feature activation+0.000
higher
Token higher
Feature activation+0.000
-
Token-
Feature activation+0.000
skill
Tokenskill
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Once
TokenOnce
Feature activation+0.000
immigrants
Token immigrants
Feature activation+0.000
started
Token started
Feature activation+0.000
to
Token to
Feature activation+0.215
arrive
Token arrive
Feature activation+2.451
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
industrial
Token industrial
Feature activation+0.000
cities
Token cities
Feature activation+0.000
they
Token they
Feature activation+0.595

INTERVAL 2.049 - 2.391
CONTAINS 0.002%

re
Tokenre
Feature activation+0.000
not
Token not
Feature activation+0.000
dissu
Token dissu
Feature activation+0.000
aded
Tokenaded
Feature activation+0.000
from
Token from
Feature activation+0.000
coming
Token coming
Feature activation+2.303
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
fear
Token fear
Feature activation+0.058
of
Token of
Feature activation+0.000
being
Token being
Feature activation+0.000
a
Token a
Feature activation+0.000
U
Token U
Feature activation+1.516
.
Token.
Feature activation+0.099
S
TokenS
Feature activation+1.050
.
Token.
Feature activation+0.000
Border
Token Border
Feature activation+2.130
Patrol
Token Patrol
Feature activation+0.033
facility
Token facility
Feature activation+0.358
in
Token in
Feature activation+0.000
Tucson
Token Tucson
Feature activation+0.000
,
Token,
Feature activation+0.000
decide
Token decide
Feature activation+0.000
that
Token that
Feature activation+0.000
the
Token the
Feature activation+0.000
time
Token time
Feature activation+0.000
had
Token had
Feature activation+0.000
come
Token come
Feature activation+2.104
to
Token to
Feature activation+0.000
give
Token give
Feature activation+0.000
workers
Token workers
Feature activation+0.000
a
Token a
Feature activation+0.000
good
Token good
Feature activation+0.000
want
Token want
Feature activation+0.000
to
Token to
Feature activation+0.000
throw
Token throw
Feature activation+0.000
every
Token every
Feature activation+0.000
last
Token last
Feature activation+0.000
illegal
Token illegal
Feature activation+2.253
alien
Token alien
Feature activation+0.000
into
Token into
Feature activation+0.223
some
Token some
Feature activation+0.000
cattle
Token cattle
Feature activation+0.000
car
Token car
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
towel
Token towel
Feature activation+0.000
on
Token on
Feature activation+0.000
policing
Token policing
Feature activation+0.000
illegal
Token illegal
Feature activation+2.122
hotels
Token hotels
Feature activation+0.000
catering
Token catering
Feature activation+0.000
mostly
Token mostly
Feature activation+0.000
to
Token to
Feature activation+0.000
offshore
Token offshore
Feature activation+0.301

INTERVAL 1.708 - 2.049
CONTAINS 0.002%

be
Token be
Feature activation+0.000
given
Token given
Feature activation+0.000
more
Token more
Feature activation+0.000
powers
Token powers
Feature activation+0.000
.
Token.
Feature activation+0.000
Foreign
Token Foreign
Feature activation+1.857
nationals
Token nationals
Feature activation+0.612
who
Token who
Feature activation+0.115
commit
Token commit
Feature activation+0.000
serious
Token serious
Feature activation+0.000
crimes
Token crimes
Feature activation+0.000
make
Token make
Feature activation+0.154
it
Token it
Feature activation+0.099
easier
Token easier
Feature activation+0.000
to
Token to
Feature activation+0.000
deport
Token deport
Feature activation+0.772
people
Token people
Feature activation+1.879
who
Token who
Feature activation+0.437
do
Token do
Feature activation+0.000
not
Token not
Feature activation+0.177
have
Token have
Feature activation+0.232
the
Token the
Feature activation+0.000
illegal
Token illegal
Feature activation+0.000
immigrant
Token immigrant
Feature activation+0.000
participating
Token participating
Feature activation+0.000
in
Token in
Feature activation+0.000
an
Token an
Feature activation+0.000
illegal
Token illegal
Feature activation+1.752
protest
Token protest
Feature activation+0.000
inside
Token inside
Feature activation+0.000
the
Token the
Feature activation+0.000
US
Token US
Feature activation+0.952
is
Token is
Feature activation+0.000
determined
Token determined
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
in
Token in
Feature activation+0.004
the
Token the
Feature activation+0.000
country
Token country
Feature activation+1.989
illegally
Token illegally
Feature activation+2.333
,
Token,
Feature activation+0.000
will
Token will
Feature activation+0.000
often
Token often
Feature activation+0.000
only
Token only
Feature activation+0.000
who
Token who
Feature activation+0.000
were
Token were
Feature activation+0.000
soon
Token soon
Feature activation+0.000
followed
Token followed
Feature activation+0.000
by
Token by
Feature activation+0.000
Spanish
Token Spanish
Feature activation+1.966
,
Token,
Feature activation+0.000
Bulgarian
Token Bulgarian
Feature activation+0.422
,
Token,
Feature activation+0.000
Czech
Token Czech
Feature activation+0.964
,
Token,
Feature activation+0.000

INTERVAL 1.366 - 1.708
CONTAINS 0.003%

âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
without
Tokenwithout
Feature activation+0.057
any
Token any
Feature activation+0.000
interference
Token interference
Feature activation+0.056
from
Token from
Feature activation+1.402
the
Token the
Feature activation+0.000
president
Token president
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
White
Token White
Feature activation+0.173
are
Token are
Feature activation+0.000
targeting
Token targeting
Feature activation+0.000
the
Token the
Feature activation+0.000
U
Token U
Feature activation+1.205
.
Token.
Feature activation+0.000
S
TokenS
Feature activation+1.540
.-
Token.-
Feature activation+0.000
born
Tokenborn
Feature activation+0.000
children
Token children
Feature activation+0.189
of
Token of
Feature activation+0.000
undocumented
Token undocumented
Feature activation+1.048
1965
Token 1965
Feature activation+0.000
immigration
Token immigration
Feature activation+0.000
reform
Token reform
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
opened
Token opened
Feature activation+1.413
the
Token the
Feature activation+0.000
flood
Token flood
Feature activation+0.172
g
Tokeng
Feature activation+0.000
ates
Tokenates
Feature activation+0.000
to
Token to
Feature activation+0.027
who
Token who
Feature activation+0.000
is
Token is
Feature activation+0.000
yet
Token yet
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
formally
Token formally
Feature activation+1.586
identified
Token identified
Feature activation+0.000
,
Token,
Feature activation+0.000
was
Token was
Feature activation+0.000
found
Token found
Feature activation+0.000
lying
Token lying
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
But
TokenBut
Feature activation+0.000
the
Token the
Feature activation+0.000
agreement
Token agreement
Feature activation+0.000
on
Token on
Feature activation+0.000
border
Token border
Feature activation+1.469
security
Token security
Feature activation+0.000
e
Token e
Feature activation+0.000
ases
Tokenases
Feature activation+0.000
passage
Token passage
Feature activation+0.248
next
Token next
Feature activation+0.000

INTERVAL 1.025 - 1.366
CONTAINS 0.005%

security
Token security
Feature activation+0.000
officials
Token officials
Feature activation+0.306
said
Token said
Feature activation+0.000
the
Token the
Feature activation+0.000
White
Token White
Feature activation+1.675
House
Token House
Feature activation+1.315
was
Token was
Feature activation+0.000
referring
Token referring
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
number
Token number
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
automatic
Token automatic
Feature activation+0.000
three
Token three
Feature activation+0.000
million
Token million
Feature activation+0.807
permanent
Token permanent
Feature activation+1.168
increase
Token increase
Feature activation+0.000
;
Token;
Feature activation+0.000
so
Token so
Feature activation+0.000
DACA
Token DACA
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
value
Token value
Feature activation+0.000
that
Token that
Feature activation+0.000
this
Token this
Feature activation+0.000
generation
Token generation
Feature activation+0.000
can
Token can
Feature activation+0.000
bring
Token bring
Feature activation+1.149
is
Token is
Feature activation+0.000
prag
Token prag
Feature activation+0.000
mat
Tokenmat
Feature activation+0.000
ism
Tokenism
Feature activation+0.000
.
Token.
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
means
Token means
Feature activation+0.000
that
Token that
Feature activation+0.000
simply
Token simply
Feature activation+0.000
accepting
Token accepting
Feature activation+1.086
Obama
Token Obama
Feature activation+0.000
's
Token's
Feature activation+0.000
priorities
Token priorities
Feature activation+0.000
in
Token in
Feature activation+0.000
to
Token to
Feature activation+0.000
ization
Tokenization
Feature activation+0.000
of
Token of
Feature activation+0.000
immigration
Token immigration
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
United
Token United
Feature activation+1.209
States
Token States
Feature activation+1.220
.
Token.
Feature activation+0.000
However
Token However
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000

INTERVAL 0.683 - 1.025
CONTAINS 0.006%

codes
Token codes
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
U
Token U
Feature activation+0.035
.
Token.
Feature activation+0.000
S
TokenS
Feature activation+0.924
.
Token.
Feature activation+0.000
were
Token were
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
associated
Tokenassociated
Feature activation+0.000
are
Token are
Feature activation+0.000
coming
Token coming
Feature activation+1.625
into
Token into
Feature activation+1.423
the
Token the
Feature activation+0.000
United
Token United
Feature activation+0.544
States
Token States
Feature activation+0.911
,
Token,
Feature activation+0.000
they
Token they
Feature activation+0.000
are
Token are
Feature activation+0.000
immediately
Token immediately
Feature activation+0.000
going
Token going
Feature activation+0.000
by
Token by
Feature activation+0.000
Spanish
Token Spanish
Feature activation+1.966
,
Token,
Feature activation+0.000
Bulgarian
Token Bulgarian
Feature activation+0.422
,
Token,
Feature activation+0.000
Czech
Token Czech
Feature activation+0.964
,
Token,
Feature activation+0.000
Yugoslav
Token Yugoslav
Feature activation+0.882
and
Token and
Feature activation+0.000
other
Token other
Feature activation+0.000
European
Token European
Feature activation+0.178
to
Token to
Feature activation+0.000
make
Token make
Feature activation+0.154
it
Token it
Feature activation+0.099
easier
Token easier
Feature activation+0.000
to
Token to
Feature activation+0.000
deport
Token deport
Feature activation+0.772
people
Token people
Feature activation+1.879
who
Token who
Feature activation+0.437
do
Token do
Feature activation+0.000
not
Token not
Feature activation+0.177
have
Token have
Feature activation+0.232
unilateral
Token unilateral
Feature activation+0.000
decision
Token decision
Feature activation+0.000
will
Token will
Feature activation+0.000
cost
Token cost
Feature activation+0.000
the
Token the
Feature activation+0.000
people
Token people
Feature activation+0.887
of
Token of
Feature activation+0.000
Travis
Token Travis
Feature activation+0.261
County
Token County
Feature activation+0.000
money
Token money
Feature activation+0.000
that
Token that
Feature activation+0.000

INTERVAL 0.342 - 0.683
CONTAINS 0.015%

of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
threat
Token threat
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
environment
Token environment
Feature activation+0.507
,
Token,
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
pace
Token pace
Feature activation+0.000
at
Token at
Feature activation+0.000
force
Token force
Feature activation+0.000
from
Token from
Feature activation+0.233
other
Token other
Feature activation+0.000
areas
Token areas
Feature activation+0.000
in
Token in
Feature activation+0.000
Argentina
Token Argentina
Feature activation+0.504
as
Token as
Feature activation+0.000
well
Token well
Feature activation+0.000
as
Token as
Feature activation+0.000
from
Token from
Feature activation+0.292
neighboring
Token neighboring
Feature activation+0.000
this
Token this
Feature activation+0.000
have
Token have
Feature activation+0.000
to
Token to
Feature activation+0.000
do
Token do
Feature activation+0.000
with
Token with
Feature activation+0.000
Norwegian
Token Norwegian
Feature activation+0.493
Christian
Token Christian
Feature activation+0.000
terrorist
Token terrorist
Feature activation+0.000
Anders
Token Anders
Feature activation+0.000
Beh
Token Beh
Feature activation+0.000
ring
Tokenring
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
has
Token has
Feature activation+0.000
worked
Token worked
Feature activation+0.000
in
Token in
Feature activation+0.000
immigration
Token immigration
Feature activation+0.351
law
Token law
Feature activation+0.000
for
Token for
Feature activation+0.052
more
Token more
Feature activation+0.000
than
Token than
Feature activation+0.000
25
Token 25
Feature activation+0.000
Studies
Token Studies
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
group
Token group
Feature activation+0.000
included
Token included
Feature activation+0.000
aliens
Token aliens
Feature activation+0.547
convicted
Token convicted
Feature activation+0.000
of
Token of
Feature activation+0.000
hundreds
Token hundreds
Feature activation+0.036
of
Token of
Feature activation+0.000
violent
Token violent
Feature activation+0.000

INTERVAL 0.000 - 0.342
CONTAINS 99.967%

ownership
Token ownership
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
OS
Token OS
Feature activation+0.000
behaviour
Token behaviour
Feature activation+0.000
and
Token and
Feature activation+0.000
functionality
Token functionality
Feature activation+0.000
on
Token on
Feature activation+0.000
Apple
Token Apple
Feature activation+0.000
machines
Token machines
Feature activation+0.000
.
Token.
Feature activation+0.000
start
Token start
Feature activation+0.000
using
Token using
Feature activation+0.000
those
Token those
Feature activation+0.000
numbers
Token numbers
Feature activation+0.000
as
Token as
Feature activation+0.000
describ
Token describ
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
of
Token of
Feature activation+0.000
value
Token value
Feature activation+0.000
.
Token.
Feature activation+0.000
They
Token They
Feature activation+0.000
Sam
Token Sam
Feature activation+0.000
oy
Tokenoy
Feature activation+0.000
eds
Tokeneds
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
N
Token N
Feature activation+0.000
en
Tokenen
Feature activation+0.000
ets
Tokenets
Feature activation+0.000
,
Token,
Feature activation+0.000
Sel
Token Sel
Feature activation+0.000
k
Tokenk
Feature activation+0.000
ups
Tokenups
Feature activation+0.000
explained
Token explained
Feature activation+0.000
Bans
Token Bans
Feature activation+0.000
al
Tokenal
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
departure
Token departure
Feature activation+0.000
as
Token as
Feature activation+0.000
an
Token an
Feature activation+0.000
outcome
Token outcome
Feature activation+0.000
of
Token of
Feature activation+0.000
huge
Token huge
Feature activation+0.000
fortune
Token fortune
Feature activation+0.000
before
Token before
Feature activation+0.000
his
Token his
Feature activation+0.000
death
Token death
Feature activation+0.000
Credit
Token Credit
Feature activation+0.000
:
Token:
Feature activation+0.000
Er
TokenEr
Feature activation+0.000
in
Tokenin
Feature activation+0.000
Jon
Token Jon
Feature activation+0.000
ass
Tokenass
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 9 in H1.6: (feature 8008

TOP ACTIVATIONS
MAX = 2.908

fur
Tokenfur
Feature activation+0.000
th
Tokenth
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+2.512
a
Token a
Feature activation+0.000
professor
Token professor
Feature activation+0.000
of
Token of
Feature activation+0.000
international
Token international
Feature activation+0.000
affairs
Token affairs
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Other
TokenOther
Feature activation+0.000
changes
Token changes
Feature activation+0.357
made
Token made
Feature activation+2.383
for
Token for
Feature activation+0.087
the
Token the
Feature activation+0.000
D
Token D
Feature activation+0.000
UAL
TokenUAL
Feature activation+0.000
SH
TokenSH
Feature activation+0.000
20
Token 20
Feature activation+0.000
%
Token%
Feature activation+0.000
reduction
Token reduction
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
original
Token original
Feature activation+2.318
award
Token award
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
How
TokenHow
Feature activation+0.000
sex
Token sex
Feature activation+0.000
ratio
Token ratio
Feature activation+0.000
at
Token at
Feature activation+0.000
birth
Token birth
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+2.280
pers
Token pers
Feature activation+0.000
isting
Tokenisting
Feature activation+0.000
well
Token well
Feature activation+0.000
into
Token into
Feature activation+0.308
adulthood
Token adulthood
Feature activation+0.000
Plan
Token Plan
Feature activation+0.000
(
Token (
Feature activation+0.000
ST
TokenST
Feature activation+0.000
P
TokenP
Feature activation+0.000
)
Token)
Feature activation+0.000
made
Token made
Feature activation+2.235
a
Token a
Feature activation+0.000
clear
Token clear
Feature activation+0.067
commitment
Token commitment
Feature activation+0.000
that
Token that
Feature activation+0.000
there
Token there
Feature activation+0.000
is
Token is
Feature activation+0.000
being
Token being
Feature activation+0.000
present
Token present
Feature activation+0.186
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
now
Token now
Feature activation+2.116
.
Token.
Feature activation+0.000
Whatever
Token Whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
do
Token do
Feature activation+0.000
creatively
Token creatively
Feature activation+0.000
changes
Token changes
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
include
Token include
Feature activation+0.000
a
Token a
Feature activation+0.000
new
Token new
Feature activation+2.082
state
Token state
Feature activation+0.000
seal
Token seal
Feature activation+0.000
,
Token,
Feature activation+0.000
were
Token were
Feature activation+0.000
being
Token being
Feature activation+0.000
her
Token her
Feature activation+0.000
eyes
Token eyes
Feature activation+0.000
ight
Tokenight
Feature activation+0.000
,
Token,
Feature activation+0.000
Olivia
Token Olivia
Feature activation+0.000
now
Token now
Feature activation+1.982
only
Token only
Feature activation+0.000
looks
Token looks
Feature activation+0.000
forward
Token forward
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
change
Token change
Feature activation+0.214
to
Token to
Feature activation+0.191
Shield
Token Shield
Feature activation+0.000
Oath
Token Oath
Feature activation+0.000
to
Token to
Feature activation+0.034
now
Token now
Feature activation+1.954
grant
Token grant
Feature activation+0.000
5
Token 5
Feature activation+0.000
%
Token%
Feature activation+0.000
more
Token more
Feature activation+0.000
damage
Token damage
Feature activation+0.000
Regulations
Token Regulations
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
new
Token new
Feature activation+1.948
rules
Token rules
Feature activation+0.000
are
Token are
Feature activation+0.000
:
Token:
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
the
Token the
Feature activation+0.000
changes
Token changes
Feature activation+0.000
approved
Token approved
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
new
Token new
Feature activation+1.898
rules
Token rules
Feature activation+0.000
have
Token have
Feature activation+0.000
now
Token now
Feature activation+1.503
been
Token been
Feature activation+0.504
declared
Token declared
Feature activation+0.000
it
Token it
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
there
Token there
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+1.893
a
Token a
Feature activation+0.000
big
Token big
Feature activation+0.000
push
Token push
Feature activation+0.000
for
Token for
Feature activation+0.000
both
Token both
Feature activation+0.000
those
Token those
Feature activation+0.000
which
Token which
Feature activation+0.000
survived
Token survived
Feature activation+0.000
adopted
Token adopted
Feature activation+0.000
the
Token the
Feature activation+0.000
new
Token new
Feature activation+1.743
open
Token open
Feature activation+0.000
,
Token,
Feature activation+0.000
global
Token global
Feature activation+0.000
standard
Token standard
Feature activation+0.000
for
Token for
Feature activation+0.000
library
Token library
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
there
Token there
Feature activation+0.000
is
Token is
Feature activation+0.000
still
Token still
Feature activation+1.723
the
Token the
Feature activation+0.000
tab
Token tab
Feature activation+0.000
that
Token that
Feature activation+0.000
lists
Token lists
Feature activation+0.000
all
Token all
Feature activation+0.000
Child
Token Child
Feature activation+0.000
-
Token-
Feature activation+0.000
free
Tokenfree
Feature activation+0.000
people
Token people
Feature activation+0.000
are
Token are
Feature activation+0.000
still
Token still
Feature activation+1.697
human
Token human
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Y
TokenY
Feature activation+0.000
ANG
TokenANG
Feature activation+0.000
ON
TokenON
Feature activation+0.000
increasing
Token increasing
Feature activation+0.054
.
Token.
Feature activation+0.000
There
Token There
Feature activation+0.000
is
Token is
Feature activation+0.000
no
Token no
Feature activation+0.000
new
Token new
Feature activation+1.692
burden
Token burden
Feature activation+0.000
that
Token that
Feature activation+0.000
wasn
Token wasn
Feature activation+0.000
't
Token't
Feature activation+0.000
already
Token already
Feature activation+0.955
that
Token that
Feature activation+0.000
introduces
Token introduces
Feature activation+0.000
changes
Token changes
Feature activation+0.000
like
Token like
Feature activation+0.000
a
Token a
Feature activation+0.000
new
Token new
Feature activation+1.639
feature
Token feature
Feature activation+0.000
,
Token,
Feature activation+0.000
a
Token a
Feature activation+0.000
bug
Token bug
Feature activation+0.000
fix
Token fix
Feature activation+0.278
.
Token.
Feature activation+0.000
Other
Token Other
Feature activation+0.000
changes
Token changes
Feature activation+0.000
include
Token include
Feature activation+0.000
a
Token a
Feature activation+0.000
new
Token new
Feature activation+1.629
two
Token two
Feature activation+0.000
-
Token-
Feature activation+0.000
minute
Tokenminute
Feature activation+0.000
warning
Token warning
Feature activation+0.000
at
Token at
Feature activation+0.000
check
Token check
Feature activation+0.000
that
Token that
Feature activation+0.000
bats
Token bats
Feature activation+0.000
meet
Token meet
Feature activation+0.000
the
Token the
Feature activation+0.000
new
Token new
Feature activation+1.614
regulations
Token regulations
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
what
Token what
Feature activation+0.000
the
Token the
Feature activation+0.000
box
Token box
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
original
Token original
Feature activation+1.554
Optimus
Token Optimus
Feature activation+0.000
Prime
Token Prime
Feature activation+0.000
contains
Token contains
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000

Top DFA by src position
MAX = 2.907

âĢ
TokenâĢ
Feature activation+0.002
Top resid features:
ľ
Tokenľ
Feature activation+0.001
Top resid features:
The
TokenThe
Feature activation+0.003
Top resid features:
situation
Token situation
Feature activation+0.017
Top resid features:
has
Token has
Feature activation-0.001
Top resid features:
changed
Token changed
Feature activation+2.907
Top resid features:
significantly
Token significantly
Feature activation+0.028
Top resid features:
in
Token in
Feature activation-0.003
Top resid features:
recent
Token recent
Feature activation+0.010
Top resid features:
years
Token years
Feature activation+0.037
Top resid features:
,
Token,
Feature activation-0.024
Top resid features:
improvements
Token improvements
Feature activation+0.761
Top resid features:
.
Token.
Feature activation+0.003
Top resid features:
Ċ
TokenĊ
Feature activation-0.002
Top resid features:
Ċ
TokenĊ
Feature activation+0.000
Top resid features:
Other
TokenOther
Feature activation-0.006
Top resid features:
changes
Token changes
Feature activation+2.017
Top resid features:
made
Token made
Feature activation-0.176
Top resid features:
for
Token for
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
D
Token D
Feature activation+0.000
Top resid features:
UAL
TokenUAL
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.007
Top resid features:
child
Token child
Feature activation-0.013
Top resid features:
support
Token support
Feature activation-0.009
Top resid features:
guidelines
Token guidelines
Feature activation+0.041
Top resid features:
have
Token have
Feature activation+0.006
Top resid features:
changed
Token changed
Feature activation+2.338
Top resid features:
to
Token to
Feature activation-0.015
Top resid features:
cause
Token cause
Feature activation-0.010
Top resid features:
a
Token a
Feature activation+0.013
Top resid features:
20
Token 20
Feature activation+0.004
Top resid features:
%
Token%
Feature activation-0.023
Top resid features:
Ċ
TokenĊ
Feature activation+0.002
Top resid features:
Well
TokenWell
Feature activation-0.016
Top resid features:
,
Token,
Feature activation-0.026
Top resid features:
things
Token things
Feature activation+0.026
Top resid features:
have
Token have
Feature activation+0.016
Top resid features:
changed
Token changed
Feature activation+2.828
Top resid features:
.
Token.
Feature activation-0.021
Top resid features:
The
Token The
Feature activation+0.004
Top resid features:
skewed
Token skewed
Feature activation+0.040
Top resid features:
sex
Token sex
Feature activation-0.035
Top resid features:
ratio
Token ratio
Feature activation-0.013
Top resid features:
ching
Tokenching
Feature activation+0.012
Top resid features:
âĢ
TokenâĢ
Feature activation+0.001
Top resid features:
Ļ
TokenĻ
Feature activation+0.012
Top resid features:
proposals
Token proposals
Feature activation+0.030
Top resid features:
for
Token for
Feature activation+0.015
Top resid features:
changes
Token changes
Feature activation+2.405
Top resid features:
at
Token at
Feature activation+0.012
Top resid features:
Ch
Token Ch
Feature activation+0.012
Top resid features:
aring
Tokenaring
Feature activation+0.004
Top resid features:
Cross
Token Cross
Feature activation+0.006
Top resid features:
,
Token,
Feature activation-0.013
Top resid features:
with
Token with
Feature activation+0.004
Top resid features:
many
Token many
Feature activation+0.004
Top resid features:
people
Token people
Feature activation-0.006
Top resid features:
,
Token,
Feature activation-0.015
Top resid features:
has
Token has
Feature activation+0.004
Top resid features:
changed
Token changed
Feature activation+2.572
Top resid features:
through
Token through
Feature activation+0.015
Top resid features:
the
Token the
Feature activation+0.022
Top resid features:
ages
Token ages
Feature activation+0.009
Top resid features:
.
Token.
Feature activation-0.014
Top resid features:
These
Token These
Feature activation+0.012
Top resid features:
explanation
Token explanation
Feature activation-0.029
Top resid features:
as
Token as
Feature activation-0.000
Top resid features:
to
Token to
Feature activation-0.014
Top resid features:
why
Token why
Feature activation+0.013
Top resid features:
the
Token the
Feature activation+0.017
Top resid features:
changes
Token changes
Feature activation+2.683
Top resid features:
,
Token,
Feature activation-0.015
Top resid features:
which
Token which
Feature activation+0.031
Top resid features:
include
Token include
Feature activation-0.047
Top resid features:
a
Token a
Feature activation+0.111
Top resid features:
new
Token new
Feature activation-0.145
Top resid features:
just
Token just
Feature activation-0.002
Top resid features:
so
Token so
Feature activation+0.004
Top resid features:
happy
Token happy
Feature activation+0.002
Top resid features:
.
Token.
Feature activation-0.006
Top resid features:
It
Token It
Feature activation+0.010
Top resid features:
changed
Token changed
Feature activation+2.668
Top resid features:
my
Token my
Feature activation+0.017
Top resid features:
life
Token life
Feature activation-0.002
Top resid features:
.
Token.
Feature activation-0.007
Top resid features:
âĢ
TokenâĢ
Feature activation-0.001
Top resid features:
Ŀ
TokenĿ
Feature activation+0.007
Top resid features:
not
Token not
Feature activation+0.003
Top resid features:
lost
Token lost
Feature activation+0.055
Top resid features:
,
Token,
Feature activation-0.043
Top resid features:
as
Token as
Feature activation+0.012
Top resid features:
the
Token the
Feature activation+0.043
Top resid features:
change
Token change
Feature activation+2.531
Top resid features:
to
Token to
Feature activation+0.019
Top resid features:
Shield
Token Shield
Feature activation+0.022
Top resid features:
Oath
Token Oath
Feature activation+0.058
Top resid features:
to
Token to
Feature activation+0.084
Top resid features:
now
Token now
Feature activation-0.203
Top resid features:
.
Token.
Feature activation-0.009
Top resid features:
Ċ
TokenĊ
Feature activation+0.006
Top resid features:
Ċ
TokenĊ
Feature activation+0.006
Top resid features:
With
TokenWith
Feature activation+0.010
Top resid features:
the
Token the
Feature activation+0.020
Top resid features:
changes
Token changes
Feature activation+1.077
Top resid features:
approved
Token approved
Feature activation-0.003
Top resid features:
,
Token,
Feature activation-0.014
Top resid features:
the
Token the
Feature activation+0.016
Top resid features:
new
Token new
Feature activation-0.007
Top resid features:
rules
Token rules
Feature activation+0.049
Top resid features:
.
Token.
Feature activation-0.019
Top resid features:
Ċ
TokenĊ
Feature activation+0.002
Top resid features:
Ċ
TokenĊ
Feature activation-0.001
Top resid features:
With
TokenWith
Feature activation+0.017
Top resid features:
the
Token the
Feature activation+0.024
Top resid features:
changes
Token changes
Feature activation+1.597
Top resid features:
approved
Token approved
Feature activation-0.024
Top resid features:
,
Token,
Feature activation-0.076
Top resid features:
the
Token the
Feature activation+0.123
Top resid features:
new
Token new
Feature activation-0.135
Top resid features:
rules
Token rules
Feature activation+0.000
Top resid features:
,
Token,
Feature activation-0.025
Top resid features:
their
Token their
Feature activation+0.029
Top resid features:
attitudes
Token attitudes
Feature activation+0.018
Top resid features:
also
Token also
Feature activation-0.015
Top resid features:
having
Token having
Feature activation+0.013
Top resid features:
changed
Token changed
Feature activation+2.513
Top resid features:
for
Token for
Feature activation-0.002
Top resid features:
the
Token the
Feature activation+0.043
Top resid features:
worse
Token worse
Feature activation+0.027
Top resid features:
.
Token.
Feature activation-0.027
Top resid features:
I
Token I
Feature activation+0.006
Top resid features:
rise
Token rise
Feature activation+0.070
Top resid features:
of
Token of
Feature activation-0.007
Top resid features:
the
Token the
Feature activation-0.002
Top resid features:
public
Token public
Feature activation-0.033
Top resid features:
Internet
Token Internet
Feature activation+0.002
Top resid features:
changed
Token changed
Feature activation+2.334
Top resid features:
everything
Token everything
Feature activation+0.024
Top resid features:
and
Token and
Feature activation-0.030
Top resid features:
those
Token those
Feature activation+0.007
Top resid features:
which
Token which
Feature activation+0.033
Top resid features:
survived
Token survived
Feature activation-0.020
Top resid features:
,
Token,
Feature activation-0.020
Top resid features:
all
Token all
Feature activation+0.004
Top resid features:
has
Token has
Feature activation+0.004
Top resid features:
now
Token now
Feature activation+0.013
Top resid features:
been
Token been
Feature activation+0.001
Top resid features:
changed
Token changed
Feature activation+1.986
Top resid features:
to
Token to
Feature activation-0.021
Top resid features:
library
Token library
Feature activation-0.023
Top resid features:
,
Token,
Feature activation-0.026
Top resid features:
and
Token and
Feature activation-0.078
Top resid features:
there
Token there
Feature activation+0.064
Top resid features:
later
Token later
Feature activation+0.009
Top resid features:
,
Token,
Feature activation-0.010
Top resid features:
and
Token and
Feature activation-0.001
Top resid features:
nothing
Token nothing
Feature activation+0.001
Top resid features:
has
Token has
Feature activation+0.008
Top resid features:
changed
Token changed
Feature activation+1.352
Top resid features:
.
Token.
Feature activation-0.004
Top resid features:
âĢ
TokenâĢ
Feature activation+0.008
Top resid features:
Ŀ
TokenĿ
Feature activation+0.005
Top resid features:
Ċ
TokenĊ
Feature activation-0.019
Top resid features:
Ċ
TokenĊ
Feature activation-0.021
Top resid features:
1994
Token 1994
Feature activation+0.020
Top resid features:
.
Token.
Feature activation-0.006
Top resid features:
It
Token It
Feature activation+0.014
Top resid features:
's
Token's
Feature activation+0.018
Top resid features:
not
Token not
Feature activation+0.017
Top resid features:
changing
Token changing
Feature activation+1.311
Top resid features:
.
Token.
Feature activation-0.002
Top resid features:
It
Token It
Feature activation+0.019
Top resid features:
's
Token's
Feature activation+0.021
Top resid features:
not
Token not
Feature activation+0.033
Top resid features:
increasing
Token increasing
Feature activation+0.305
Top resid features:
-
Token -
Feature activation-0.014
Top resid features:
Any
Token Any
Feature activation-0.001
Top resid features:
branch
Token branch
Feature activation+0.026
Top resid features:
that
Token that
Feature activation+0.000
Top resid features:
introduces
Token introduces
Feature activation+0.086
Top resid features:
changes
Token changes
Feature activation+0.928
Top resid features:
like
Token like
Feature activation-0.006
Top resid features:
a
Token a
Feature activation+0.093
Top resid features:
new
Token new
Feature activation-0.198
Top resid features:
feature
Token feature
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
-
Token-
Feature activation-0.017
Top resid features:
yard
Tokenyard
Feature activation-0.045
Top resid features:
line
Token line
Feature activation+0.013
Top resid features:
.
Token.
Feature activation-0.043
Top resid features:
Other
Token Other
Feature activation-0.003
Top resid features:
changes
Token changes
Feature activation+2.278
Top resid features:
include
Token include
Feature activation-0.103
Top resid features:
a
Token a
Feature activation+0.126
Top resid features:
new
Token new
Feature activation-0.091
Top resid features:
two
Token two
Feature activation+0.000
Top resid features:
-
Token-
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.007
Top resid features:
Ċ
TokenĊ
Feature activation+0.006
Top resid features:
There
TokenThere
Feature activation+0.013
Top resid features:
are
Token are
Feature activation+0.018
Top resid features:
no
Token no
Feature activation+0.009
Top resid features:
changes
Token changes
Feature activation+2.092
Top resid features:
to
Token to
Feature activation+0.006
Top resid features:
the
Token the
Feature activation+0.025
Top resid features:
permitted
Token permitted
Feature activation+0.002
Top resid features:
width
Token width
Feature activation+0.006
Top resid features:
and
Token and
Feature activation+0.004
Top resid features:
ines
Tokenines
Feature activation+0.017
Top resid features:
.
Token.
Feature activation-0.009
Top resid features:
So
Token So
Feature activation+0.016
Top resid features:
the
Token the
Feature activation+0.031
Top resid features:
picture
Token picture
Feature activation-0.021
Top resid features:
changes
Token changes
Feature activation+2.055
Top resid features:
as
Token as
Feature activation+0.009
Top resid features:
you
Token you
Feature activation+0.010
Top resid features:
hold
Token hold
Feature activation-0.006
Top resid features:
the
Token the
Feature activation+0.021
Top resid features:
cards
Token cards
Feature activation+0.003
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.04

Head 2: 0.04

Head 3: 0.02

Head 4: 0.05

Head 5: 0.06

Head 6: 0.52

Head 7: 0.06

Head 8: 0.03

Head 9: 0.03

Head 10: 0.03

Head 11: 0.07

Positive logits

Changes2.40

changes2.09

redesign2.08

drastically1.98

reverted1.98

Differences1.88

changes1.87

radically1.86

Balance1.86

revert1.84

��1.84

merged1.78

Changes1.78

Kitt1.77

leveled1.74

decre1.71

modified1.71

outdated1.71

redesigned1.71

��1.70

Negative logits

cade-2.14

pd-2.07

olicited-1.85

tu-1.75

ideo-1.71

venture-1.70

lehem-1.70

liga-1.68

venture-1.68

cam-1.67

afer-1.67

ecd-1.65

aleb-1.64

defense-1.64

entious-1.63

lov-1.62

anamo-1.61

Fre-1.61

ofi-1.61

abl-1.60

INTERVAL 2.617 - 2.908
CONTAINS 0.000%

INTERVAL 2.326 - 2.617
CONTAINS 0.000%

fur
Tokenfur
Feature activation+0.000
th
Tokenth
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+2.512
a
Token a
Feature activation+0.000
professor
Token professor
Feature activation+0.000
of
Token of
Feature activation+0.000
international
Token international
Feature activation+0.000
affairs
Token affairs
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Other
TokenOther
Feature activation+0.000
changes
Token changes
Feature activation+0.357
made
Token made
Feature activation+2.383
for
Token for
Feature activation+0.087
the
Token the
Feature activation+0.000
D
Token D
Feature activation+0.000
UAL
TokenUAL
Feature activation+0.000
SH
TokenSH
Feature activation+0.000

INTERVAL 2.036 - 2.326
CONTAINS 0.001%

Plan
Token Plan
Feature activation+0.000
(
Token (
Feature activation+0.000
ST
TokenST
Feature activation+0.000
P
TokenP
Feature activation+0.000
)
Token)
Feature activation+0.000
made
Token made
Feature activation+2.235
a
Token a
Feature activation+0.000
clear
Token clear
Feature activation+0.067
commitment
Token commitment
Feature activation+0.000
that
Token that
Feature activation+0.000
there
Token there
Feature activation+0.000
is
Token is
Feature activation+0.000
being
Token being
Feature activation+0.000
present
Token present
Feature activation+0.186
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
now
Token now
Feature activation+2.116
.
Token.
Feature activation+0.000
Whatever
Token Whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
do
Token do
Feature activation+0.000
creatively
Token creatively
Feature activation+0.000
sex
Token sex
Feature activation+0.000
ratio
Token ratio
Feature activation+0.000
at
Token at
Feature activation+0.000
birth
Token birth
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+2.280
pers
Token pers
Feature activation+0.000
isting
Tokenisting
Feature activation+0.000
well
Token well
Feature activation+0.000
into
Token into
Feature activation+0.308
adulthood
Token adulthood
Feature activation+0.000
20
Token 20
Feature activation+0.000
%
Token%
Feature activation+0.000
reduction
Token reduction
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
original
Token original
Feature activation+2.318
award
Token award
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
How
TokenHow
Feature activation+0.000
changes
Token changes
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
include
Token include
Feature activation+0.000
a
Token a
Feature activation+0.000
new
Token new
Feature activation+2.082
state
Token state
Feature activation+0.000
seal
Token seal
Feature activation+0.000
,
Token,
Feature activation+0.000
were
Token were
Feature activation+0.000
being
Token being
Feature activation+0.000

INTERVAL 1.745 - 2.036
CONTAINS 0.001%

change
Token change
Feature activation+0.214
to
Token to
Feature activation+0.191
Shield
Token Shield
Feature activation+0.000
Oath
Token Oath
Feature activation+0.000
to
Token to
Feature activation+0.034
now
Token now
Feature activation+1.954
grant
Token grant
Feature activation+0.000
5
Token 5
Feature activation+0.000
%
Token%
Feature activation+0.000
more
Token more
Feature activation+0.000
damage
Token damage
Feature activation+0.000
it
Token it
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
there
Token there
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+1.893
a
Token a
Feature activation+0.000
big
Token big
Feature activation+0.000
push
Token push
Feature activation+0.000
for
Token for
Feature activation+0.000
both
Token both
Feature activation+0.000
the
Token the
Feature activation+0.000
changes
Token changes
Feature activation+0.000
approved
Token approved
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
new
Token new
Feature activation+1.898
rules
Token rules
Feature activation+0.000
have
Token have
Feature activation+0.000
now
Token now
Feature activation+1.503
been
Token been
Feature activation+0.504
declared
Token declared
Feature activation+0.000
Regulations
Token Regulations
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
new
Token new
Feature activation+1.948
rules
Token rules
Feature activation+0.000
are
Token are
Feature activation+0.000
:
Token:
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
her
Token her
Feature activation+0.000
eyes
Token eyes
Feature activation+0.000
ight
Tokenight
Feature activation+0.000
,
Token,
Feature activation+0.000
Olivia
Token Olivia
Feature activation+0.000
now
Token now
Feature activation+1.982
only
Token only
Feature activation+0.000
looks
Token looks
Feature activation+0.000
forward
Token forward
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000

INTERVAL 1.454 - 1.745
CONTAINS 0.002%

is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
story
Token story
Feature activation+0.000
many
Token many
Feature activation+0.000
have
Token have
Feature activation+0.000
now
Token now
Feature activation+1.468
.
Token.
Feature activation+0.000
For
Token For
Feature activation+0.000
20
Token 20
Feature activation+0.000
years
Token years
Feature activation+0.000
,
Token,
Feature activation+0.000
shelves
Token shelves
Feature activation+0.000
.
Token.
Feature activation+0.000
As
Token As
Feature activation+0.000
of
Token of
Feature activation+0.000
right
Token right
Feature activation+0.000
now
Token now
Feature activation+1.458
,
Token,
Feature activation+0.000
sour
Token sour
Feature activation+0.000
beers
Token beers
Feature activation+0.000
may
Token may
Feature activation+0.000
still
Token still
Feature activation+1.136
the
Token the
Feature activation+0.000
F
Token F
Feature activation+0.000
-
Token-
Feature activation+0.000
35
Token35
Feature activation+0.000
program
Token program
Feature activation+0.000
since
Token since
Feature activation+1.537
Trump
Token Trump
Feature activation+0.000
took
Token took
Feature activation+0.043
office
Token office
Feature activation+0.000
.
Token.
Feature activation+0.000
Most
Token Most
Feature activation+0.000
increasing
Token increasing
Feature activation+0.054
.
Token.
Feature activation+0.000
There
Token There
Feature activation+0.000
is
Token is
Feature activation+0.000
no
Token no
Feature activation+0.000
new
Token new
Feature activation+1.692
burden
Token burden
Feature activation+0.000
that
Token that
Feature activation+0.000
wasn
Token wasn
Feature activation+0.000
't
Token't
Feature activation+0.000
already
Token already
Feature activation+0.955
said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
new
Token new
Feature activation+1.456
state
Token state
Feature activation+0.000
flag
Token flag
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
Republic
Token Republic
Feature activation+0.000

INTERVAL 1.163 - 1.454
CONTAINS 0.002%

demanding
Token demanding
Feature activation+0.000
that
Token that
Feature activation+0.000
every
Token every
Feature activation+0.000
public
Token public
Feature activation+0.000
school
Token school
Feature activation+0.000
now
Token now
Feature activation+1.213
allow
Token allow
Feature activation+0.000
grown
Token grown
Feature activation+0.000
men
Token men
Feature activation+0.000
and
Token and
Feature activation+0.000
boys
Token boys
Feature activation+0.000
would
Token would
Feature activation+0.000
have
Token have
Feature activation+0.000
to
Token to
Feature activation+0.000
see
Token see
Feature activation+0.000
the
Token the
Feature activation+0.000
original
Token original
Feature activation+1.367
calculations
Token calculations
Feature activation+0.000
presented
Token presented
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
court
Token court
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
These
TokenThese
Feature activation+0.000
new
Token new
Feature activation+1.362
rules
Token rules
Feature activation+0.000
I
Token I
Feature activation+0.000
hear
Token hear
Feature activation+0.000
about
Token about
Feature activation+0.000
are
Token are
Feature activation+0.000
warm
Token warm
Feature activation+0.000
2015
Token 2015
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
is
Token is
Feature activation+0.000
now
Token now
Feature activation+1.410
being
Token being
Feature activation+0.000
followed
Token followed
Feature activation+0.000
by
Token by
Feature activation+0.000
a
Token a
Feature activation+0.000
record
Token record
Feature activation+0.000
hard
Token hard
Feature activation+0.000
to
Token to
Feature activation+0.000
achieve
Token achieve
Feature activation+0.000
as
Token as
Feature activation+0.000
the
Token the
Feature activation+0.000
current
Token current
Feature activation+1.403
bitcoin
Token bitcoin
Feature activation+0.000
for
Token for
Feature activation+0.000
king
Tokenking
Feature activation+0.000
discussions
Token discussions
Feature activation+0.000
show
Token show
Feature activation+0.000

INTERVAL 0.872 - 1.163
CONTAINS 0.003%

in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.009
game
Token game
Feature activation+0.000
that
Token that
Feature activation+0.000
might
Token might
Feature activation+0.000
make
Token make
Feature activation+1.031
someone
Token someone
Feature activation+0.000
uncomfortable
Token uncomfortable
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
.
Token.
Feature activation+0.000
July
Token July
Feature activation+0.000
.
Token.
Feature activation+0.000
But
Token But
Feature activation+0.000
after
Token after
Feature activation+0.038
the
Token the
Feature activation+0.000
new
Token new
Feature activation+1.141
U
Token U
Feature activation+0.000
.
Token.
Feature activation+0.000
S
TokenS
Feature activation+0.000
.
Token.
Feature activation+0.000
sanctions
Token sanctions
Feature activation+0.000
internally
Token internally
Feature activation+0.000
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
decided
Token decided
Feature activation+0.087
to
Token to
Feature activation+0.343
remove
Token remove
Feature activation+1.145
that
Token that
Feature activation+0.000
because
Token because
Feature activation+0.107
we
Token we
Feature activation+0.000
want
Token want
Feature activation+0.000
the
Token the
Feature activation+0.075
lot
Token lot
Feature activation+0.000
over
Token over
Feature activation+0.000
the
Token the
Feature activation+0.000
last
Token last
Feature activation+0.234
season
Token season
Feature activation+0.000
from
Token from
Feature activation+0.892
a
Token a
Feature activation+0.000
c
Token c
Feature activation+0.000
ower
Tokenower
Feature activation+0.000
ing
Tokening
Feature activation+0.000
young
Token young
Feature activation+0.000
and
Token and
Feature activation+0.000
it
Token it
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
become
Token become
Feature activation+0.933
common
Token common
Feature activation+0.038
to
Token to
Feature activation+0.266
find
Token find
Feature activation+0.000
sour
Token sour
Feature activation+0.000
beers
Token beers
Feature activation+0.000

INTERVAL 0.582 - 0.872
CONTAINS 0.008%

discuss
Token discuss
Feature activation+0.000
the
Token the
Feature activation+0.000
future
Token future
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
planet
Token planet
Feature activation+0.743
,
Token,
Feature activation+0.000
some
Token some
Feature activation+0.000
are
Token are
Feature activation+0.000
beginning
Token beginning
Feature activation+0.000
to
Token to
Feature activation+0.000
This
Token This
Feature activation+0.000
was
Token was
Feature activation+0.000
really
Token really
Feature activation+0.000
very
Token very
Feature activation+0.000
removed
Token removed
Feature activation+0.815
from
Token from
Feature activation+0.820
the
Token the
Feature activation+0.000
physical
Token physical
Feature activation+0.000
violence
Token violence
Feature activation+0.000
going
Token going
Feature activation+0.000
on
Token on
Feature activation+0.000
Mormon
Token Mormon
Feature activation+0.000
.
Token.
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
It
Token It
Feature activation+0.000
made
Token made
Feature activation+0.704
me
Token me
Feature activation+0.000
think
Token think
Feature activation+0.000
about
Token about
Feature activation+0.000
doctr
Token doctr
Feature activation+0.000
inal
Tokeninal
Feature activation+0.000
up
Token up
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
of
Token of
Feature activation+0.000
bringing
Token bringing
Feature activation+0.000
new
Token new
Feature activation+0.797
features
Token features
Feature activation+0.000
to
Token to
Feature activation+0.000
Bitcoin
Token Bitcoin
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
away
Tokenaway
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
cost
Token cost
Feature activation+0.000
.
Token.
Feature activation+0.000
Back
Token Back
Feature activation+0.610
in
Token in
Feature activation+0.000
November
Token November
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
DOD
Token DOD
Feature activation+0.000

INTERVAL 0.291 - 0.582
CONTAINS 0.017%

Ċ
TokenĊ
Feature activation+0.000
Because
TokenBecause
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
differences
Token differences
Feature activation+0.000
between
Token between
Feature activation+0.426
Unity
Token Unity
Feature activation+0.000
4
Token 4
Feature activation+0.000
and
Token and
Feature activation+0.000
5
Token 5
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
guise
Token guise
Feature activation+0.000
of
Token of
Feature activation+0.000
Oracle
Token Oracle
Feature activation+0.000
and
Token and
Feature activation+0.000
become
Token become
Feature activation+0.350
,
Token,
Feature activation+0.000
arguably
Token arguably
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
primer
Token primer
Feature activation+0.000
ard
Tokenard
Feature activation+0.000
as
Token as
Feature activation+0.000
I
Token I
Feature activation+0.000
have
Token have
Feature activation+0.000
done
Token done
Feature activation+0.000
since
Token since
Feature activation+0.477
first
Token first
Feature activation+0.000
being
Token being
Feature activation+0.000
elected
Token elected
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
ir
Tokenir
Feature activation+0.000
ate
Tokenate
Feature activation+0.000
is
Token is
Feature activation+0.000
transforming
Token transforming
Feature activation+0.000
itself
Token itself
Feature activation+0.000
into
Token into
Feature activation+0.485
a
Token a
Feature activation+0.000
sporting
Token sporting
Feature activation+0.000
superpower
Token superpower
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Bill
Token Bill
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
This
TokenThis
Feature activation+0.000
will
Token will
Feature activation+0.000
introduce
Token introduce
Feature activation+0.335
a
Token a
Feature activation+0.000
single
Token single
Feature activation+0.000
-
Token-
Feature activation+0.000
tier
Tokentier
Feature activation+0.000
state
Token state
Feature activation+0.000

INTERVAL 0.000 - 0.291
CONTAINS 99.967%

played
Token played
Feature activation+0.000
golf
Token golf
Feature activation+0.000
and
Token and
Feature activation+0.000
basketball
Token basketball
Feature activation+0.000
at
Token at
Feature activation+0.000
Ball
Token Ball
Feature activation+0.000
State
Token State
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Call
TokenCall
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
not
Token not
Feature activation+0.000
doing
Token doing
Feature activation+0.000
at
Token at
Feature activation+0.000
all
Token all
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
But
TokenBut
Feature activation+0.000
data
Token data
Feature activation+0.000
that
Token that
Feature activation+0.000
the
Token the
Feature activation+0.000
National
Token National
Feature activation+0.000
Security
Token Security
Feature activation+0.000
Agency
Token Agency
Feature activation+0.000
(
Token (
Feature activation+0.000
NSA
TokenNSA
Feature activation+0.000
)
Token)
Feature activation+0.000
pulls
Token pulls
Feature activation+0.000
data
Token data
Feature activation+0.000
of
Token of
Feature activation+0.000
Americans
Token Americans
Feature activation+0.000
'
Token'
Feature activation+0.000
data
Token data
Feature activation+0.000
is
Token is
Feature activation+0.000
also
Token also
Feature activation+0.000
collected
Token collected
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
much
Token much
Feature activation+0.000
as
Token as
Feature activation+0.000
they
Token they
Feature activation+0.000
have
Token have
Feature activation+0.000
,"
Token,"
Feature activation+0.000
K
Token K
Feature activation+0.000
av
Tokenav
Feature activation+0.000
uru
Tokenuru
Feature activation+0.000
said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000